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ENGINEERED STERILE PLANTS AND USES THEREFOR 

5 The present mven'ion relates generally to plants and more particularly to woody 
perennial plants such as eucalypts with reduced reproductive capacity. The present 
inv-ntion is also directed to genetic constructs useful in the generation of woody 
perennial plants with reduced reproductive capacity. The present invention is most 
particularly directed to sterile trees of the genus Eucalyptus which are useful in the 
10 establishment of wood lot plantations, in re-forestation projects and in developing 
modified eucalyptus trees with beneficial properties. 

Bibliographic details of the publications referred to in this specification are.collected at. 
the end of the description. Sequence identity numbers (SEQ ID NOs,) for the nucleotide 
15 and amino acid sequences referred to in the specification are defined, after the 
bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
20 imply the inclusion of a stated element or integer or group of clerz-nts or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

Expanding urbanisation, the agricultural industry's requirement for more arable land and 
the increase in demand for wood from a range of timber-related industries are 

25 contributing to rapidly diminishing native forests. Despite the serious economic 
consequences of exhausting supply of this primary product, there are also serious 
environmental concerns commensurate with the loss of native forests. There is a need, 
therefore, to ensure adequate re-forestation and to establish wood lots of native trees in 
both national and regional parks and on private land holdings. There is also a need to 

30 develop new varieties of native trees with beneficial properties such as resistance to pest 
infestation or reduced frost sensitivity. 
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Although a number of reproductive genes have been isolated from plants such as 
Arabidopsis and Antirrhinum, until now, analogous genes had not been cloned from 
eucalypts. In accordance with the present invention, the inventors have cloned a number 
of reproductive genes involved in floral development in woody perennials such as 
5 eucalypts. The isolation and characterization of these genes will enable the generation 
of a series of eucalypts and other woody perennials with reduced reproductive capacity. 
Such plants will be useful in, for example, establishing wood lots for the timber and 
agricultural industries as 'veil as in environmental planing. Furthermore, the availability 
of sterile eucalypts provides for the opportunity to introduce genes or other genetic 
10 sequences conferring new traits without the possibility that these traits are passed onto 
other trees in the event that the traits are deemed undesirable. The techniques developed 
in accordance with the present invention will allow the generation of a range of 
transgenic eucalypt trees exhibiting a variety of beneficial properties, 

15 Accordingly, one aspect of the present invention contemplates a nucleic acid molecule 
comprising a sequence of nucleotides encoding or complementary to a sequence 
encoding a product or part, fragment, mutant, analogue or homologue thereof required 
for normal flower development in a woody perennial. For convenience, a part, 
fragment, mutant, analogue or homologue of the above-mentioned product is referred 

20 herein as a "derivative". Accordingly, the present invention provides an isolated nucleic 
acid molecule comprising a sequence of nucleotides encoding or complementary to a 
sequence encoding a product or derivative thereof required for normal flower 
development ;n a woody perennial. 

25 Preferably, the woody perennial is a species of the genus Eucalyptus. Examples of 
eucalypt trees contemplated by the present invention include but are not limited to one 
or more of Eucalyptus alpina, Eucalyptus annulata, Eucalyptus astrmgens. Eucalyptus 
beh'ic.-. Eucalyptus benthamii, Eucalyptus botrvoides. Eucalyptus brockwayi. 
Eucalyptus caesia, Eucalyptus calophylla * Rosea \ Eucalyptus calycogona t Eucalyptus 

30 camaldulensis, Eucalyptus camaldulensis ssp. subcinerea, Eucalyptus campaspe. 
Eucalyptus camphora, Eucalyptus citriodora t Eucalyptus cladocalyx, Eucalyptus 
cladocalyx 'Nana \ Eucalyptus cloeziana, Eucalyptus cornuta, Eucalyptus cosmophylla. 
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Eucalyptus crenulata, Eucalyptus crucis, Eucalyptus cielegatensis, Eucalyptus 
desmondensis. Eucalyptus die Is it, Eucalyptus dipt era. Eucalyptus diversicolor, Eucalyptus 
diversi/olia, Eucalyptus dumosa. Eucalyptus dundasii, Eucalyptus dunnii. Eucalyptus 
eremophila, Eucalyptus erythrocorys, Eucalyptus erythronema, Eucalyptus ficifolia, 
Eucalyptus flocktoniae, Eucalyptus foecunda, Eucalyptus forrestiana. Eucalyptus 
gardneri, Eucalyptus gillii. Eucalyptus globulus, Eucalyptus globulus ssp globulus, 
Eucalyptus globulus ssp. pseudoglobulus, Eucalyptus globulus ssp maidenii, Eucalyptus 
globulus ssp bicostata, Eucalyptus gomphocephala. Eucalyptus gracilis. Eucalyptus 
grandis. Eucalyptus griffithsii, Eucalyptus grossa, Eucalyptus henrii, Eucalyptus 
increassata, Eucalyptus kitsoniana, Eucalyptus kondininensis, Eucalyptus kruseana. 
Eucalyptus lansdowneana. Eucalyptus largijlorens. Eucalyptus lehmanni. Eucalyptus 
leucoxylon, Eucalyptus leucoxyl on (Pink) (syn. 'Rosea'), Eucalyptus lesouefii. Eucalyptus 
macraridra, Eucalyptus macrocarpa, Eucalyptus maculata, Eucalyptus marginata. 
Eucalyptus melliodora, Eucalyptus microcarpa, Eucalyptus nicholii, Eucalyptus nitens. 
Eucalyptus nutans. Eucalyptus obliqua. Eucalyptus occidentalis, Eucalyptus ochrophloia, 
\ilyptus oleosa. Eucalyptus orbifolia. Eucalyptus ovata, Eucalyptus pauciflora, 
Eucalyptus parvijolia. Eucalyptus perriniana, Eucalyptus pileata. Eucalyptus pellita. 
Eucalyptus pilularis, Eucalyptus platypus, Eucalyptus polyanthemos, Eucalyptus porosa. 
Eucalyptus pre issiana, Eucalyptus pyriformis. Eucalyptus regnans, Eucalyptus resinifera, 
Eucalyptus rhodantha. Eucalyptus robusta, Eucalyptus saligna, * Eucalyptus 
salmonophloia, Eucalyptus salubris. Eucalyptus sargentii, Eucalyptus sepulcralis. 
Eucalyptus sideroxylon 'Rosea Eucalyptus sieberi, Eucalyptus spathulata. Eucalyptus 
socialis, Eucalyptus steedmanii, Eucalyptus stoatei, Eucalyptus stricklandii, Eucalyptus 
tereticomis. Eucalyptus tetragona, Eucalyptus tetraptera. Eucalyptus torquata. 
Eucalyptus Torwood', Eucalyptus urnigera, Eucalyptus urophylla, Eucalyptus viminalis, 
Eucalyptus viridis, Eucalyptus wandoo, Eucalyptus websteriana. Eucalyptus woodwardii. 
The species of Eucalyptus may be selected from any or all of the above-listed species 
or from other species of Eucalyptus or from species related to eucalypts or which have 
eucalypt-like properties. Preferably, the eucalypt tree is Eucalyptus benthamii. 
Eucalyptus botryoides, Eucalyptus camaldulensis, Eucalyptus citriodora. Eucalyptus 
cloeziana. Eucalyptus delegatensis. Eucalyptus diversicolor, Eucalpytus dunnil 
Eucalyptus globulus, Eucalyptus globulus subspecies bicostata. Eucalyptus globulus 
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subspecies globulus, Eucalyptus globulus subspecies maidenii, Eucalyptus globulus 
subspecies pseudoglobulus, Eucalyptus grandis, Eucalyptus henrii, Eucalyptus maculata, 
Eucalyptus marginata, Eucalyptus nitens. Eucalyptus obliqua. Eucalyp"'S pellita, 
Eucalyptus pilularis, Eucalyptus regnans. Eucalyptus resinifera, Eucalyptus saligna, 
5 Eucalyptus sieberi t Eucalyptus tereticornis, Eucalyptus urophylla t Eucalyptus viminalis. 
Most preferably, the eucalypt tree is Eucalyptus globulus ssp globulus or Eucalyptus 
nitens. 

The nucleic acid is generally composed of deoxyribonucleotides and may be genomic 
10 DNA or cDNA. The nucleotide sequence of the nucleic acid molecule may correspond 
exactly with the nucleotide sequence of the naturally occurring gene or may contain one 
or more' nucleotide substitutions, deletions and/or additions. Where the DNA is genomic 
DNA, the DNA may or rtiay not contain introns. 

15 In a particularly preferred embodiment, the reproductive gene is related in sequence 
and/or function to a corresponding gene in Arabidopsis and/or Antirrhinum. 

Even more particularly, the reproductive gene corresponds to a gene selected from leafy 
(LEY), apetata 1 (API) or agamous {AG) from Arabidopsis or floricaula (FLO). 
20 squamosa (SQUA) or plena (PLE) from Antirrhinum. 

In a most preferred embodiment, the present invention provides an isolated DNA 
molecule selected from the following: 

having a sequence set forth in SEQ ID NO:l or 3 or 5 and analogous to LEY 
and/or FLO; 

having a sequence set forth in SEQ ID NO:9 or 1 1 or 13 and analogous to API 
and/or SQUA; or 

having a sequence set forth in SEQ ID NO:7 or 15 and analogous to AG and/or 
PLE 



25 (i) 

(ii) 
(iii) 

30 
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The eucalypt genes defined in (i), (ii) and (iii) above are referred to herein as FLE, SQE 
and AGE, respectively. 

According to preferred embodiments of the present invention, there is provided in one 
aspect, a DNA molecule of eucalypt origin encoding a reproductive gene and capable 
of hybridizing under low stringency conditions to a nucleic acid molecule having a 
nucleotide sequence as set forth in or complementary to SEQ ID NO:l or 3 or 5. 

In another aspect, there is provided a DNA molecule of eucalypt origin encoding a 
reproductive gene and capable of hybridizing under low stringency conditions to a 
nucleic and molecule having a nucleotide sequence as set forth in or complementary to 
SEQ IDNO:9or 11 or 13. . 

In still another aspect, there is provided a DNA molecule of eucalypt origin encoding 
a reproductive gene and capable of hybridizing under low stringency conditions to a 
nucleic and molecule having a nucleotide sequence as set forth in or complementary to 
SEQ ID NO:7or 15. 

For the purposes of defining the level of stringency, reference can conveniently be made 
to Sambrook et al (1989) which is herein incorporated by reference where the washing 
steps at pages 9.52-9.57 are considered high stringency. A low stringency is defined 
herein as being in 0.1-0.5% w/v SDS at 37-45°C for 2-3 hours. Depending on the 
source and concentration of nucleic acid involved in the hybridisation, alternative 
conditions of stringency may be employed such as medium stringent conditions which 
are considered herein to be 0.25%-0.5% w/v SDS at > 45°C for 2-3 hours or high 
stringent conditions as disclosed by Sambrook et al (1989). 

In an alternative embodiment, the preferred nucleic acid molecule of the present 
invention comprises a nucleotide sequence having at least about 45%, more preferably 
at least about 55%, still more preferably at least about 65%, yet still more preferably at 
least about 75-80% and even still more preferably at least about 85-95% nucleotide 
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similarity to at least one of SEQ ID NO: I or 3 or 5, SEQ ID NO:9 or 1 1 or 1 3 or SEQ 
ID NO:7 or 15. 

The present invention also contemplates a method for identifying a gene required for 
5 normal flower development in a woody perennial, said method comprising isolating a 
nucleic acid molecule from said woody perennial and cloning a region of said nucleic 
acid molecule having at least 45% nucleotide similarity to all or a region of Arabidopsis 
genes LFY, API or AG or Antirrhinum genes FLO, SQUA or PLE. 

10 Preferably, the woody perennial is a species of the genus Eucalyptus such as Eucalyptus 
benthamil Eucalyptus botryoides, Eucalyptus camaldulertsis, Eucalyptus citriodora, 
Eucalyptus cloeziana. Eucalyptus delegatensis. Eucalyptus diver.Ucolor, Eucalpytus 
dunnii, Eucalyptus globulus. Eucalyptus globulus subspecies bicostatq. Eucalyptus' 
globulus subspecies globulus, Eucalyptus globulus subspecies maidenii, Eucalyptus 

1 5 . globulus subspecies pseudoglobulus. Eucalyptus grandis, Eucalyptus henrii. Eucalyptus 
maculata. Eucalyptus marginata, Eucalyptus nitens. Eucalyptus obliqua. Eucalyptus 
pellita, Eucalyptus pilularis, Eucalyptus regnans. Eucalyptus resinifera, Eucalyptus 
saligna, Eucalyptus sieberi. Eucalyptus tereticornis. Eucalyptus urophylla. Eucalyptus 
. viminalis. Most preferably, the species is Eucalyptus globulus ssp globulus or 

20 Eucalyptus nitens. 

The nucleic acid molecules of the present invention are useftil for generating antisense 
or ribozyme molecules or overexpression of sense molecules using a constitutive 
promoter or in co-suppression of the subject reproductive genes. By targeting such 
genes, their expression is diminished, reduced or otherwise lowered to levels which 
result in reproductive sterility or at least substantially reduced reproductive capacity. 

According to one embodiment there is provided a nucleic acid molecule comprising at 
least 5 contiguous nucleotide bases capable of hybridizing to, forming a duplex with or 
30 otherwise reducing translation of target reproductive gene-specified mRNA. This 
embodiment of the present invention defines primers, probes and other antisense 
molecules capable of instituting or at least adducing translation of the reproductive gene- 
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specified mRNA. Although the preferred antisense molecules are at least about 10 to 
20 nucleotides in length, the present invention extends to molecules of 50-100 nucleotide 
bases in length or a molecule corresponding to a full length or substantially full length 
reproductive gene-specified mRNA. 

Co-suppression involves introducing a second copy of an endogenous gene or a 
substantially similar copy or analogue of an endogenous gene. Over expression involves 
introducing a second copy of an endogenous gene or a substantially similar copy or 
analogue of an endogenous gene fused to a constitutive promoter. 



10 



Ribozymes are synthetic RNA molecules which possess highly specific endoribonuclease 
activity. In particular, they comprise a hybridizing region which i.«=, complementary in 
nucleotide sequence to at least part of a target RNA. Ribozymes are wel! described by 
Haseloff a.id Gerlach (1988) and in International Patent Application No, 89/05852. 
1 5 The present invention extends to ribozymes which target reproductive gene mRNA. 

According to this embodiment, there is provided a ribozyme comprising a hybridizing 
region and a catalytic region wherein the hybridizing region is capable of hybridizing 
to at least part of a target mRNA sequence transcribed from a reproductive gene and 
20 more particularly as defined by SEQ ID NO:l or 3 or 5. SEQ ID NO:9 or 1 1 or 13 or 
SEQ ID N0 7 or 15, wherein said catalytic domain is capable of cleaving said target 
mRNA sequence to reduce or substantially inhibit reproductive capacity in a woody 
perennial and more particularly a eucalypt. 

25 The present invention extends to genetic constructs designed to facilitate expression of 
^ the antisense molecule, co-suppression molecule, overexpression molecule or ribozyme. 

Generally, the genetic construct comprises in addition to the subject nucleic acid 
molecule, a promoter and optionally other regulatory sequences to facilitate expression 
of the nucleic acid molecule. The promoter may be derived from a genomic clone. 
30 encoding the reproductive gene or may be a heterologous promoter from another source. 
Preferably, however, the promoter is tissue specific capable of expression substantially 
only in reproductive tissue for antisense, ribozyme and co-suppression molecules. For 
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Yet another aspect of the present invention contemplates a method of minimising 
inflorescence in a woody perennial such as a eucalypt, said method comprising 
introducing into one or more cells of said woody perennial a genetic construe capable 
of down regulating expression of a reproductive gene in said woody perennial. 
5 regenerating a transgenic plant from said one or more cells and growing said plant for 
a time and under conditions sufficient to permit expression of said genetic construct 

Preferably, the woody perennial is a eucalypt, such as selected from the previously 
mentioned list. Preferably, the eucalyptus is Eucalyptus benthamii. Eucalyptus 

10 botryoides, Eucalyptus camaldulensis. Eucalyptus citriodora. Eucalyptus cloeiiana. 
Eucalyptus delegatensis. Eucalyptus diversicolor, Eucalpytus dunnii. Eucalyptus 
globulus. Eucalyptus globulus subspecies globulus. Eucalyptus globlulus subspecies 
bicostata. Eucalyptus globulus subspecies maidenii. Eucalyptus globulus subspecies 
pseudoglobulus, Eucalyptus grandis. Eucalyptus henrii. Eucalyptus maculate. Eucalyptus 

15 marginata, Eucalyptus nitens, Eucalyptus obliqua. Eucalyptus pellita. Eucalyptus 
pilularis, Eucalyptus regnans. Eucalyptus resinifera, Eucalyptus saligna. Eucalyptus 
sieberi, Eucalyptus tereticornis. Eucalyptus urophylla, Eucalyptus virhinali*. Most 
preferably the eucalypt is Eucalyptus globulus ssp globulus or Eucalyptus nitens. 

20 Preferably, . the reproduction gene corresponds to one or more of SEQ ID NO:l or 3 or 
5, SEQ ID NO:9 or 11 or 13 or SEQ ID NO:7 or 15. 

Preferably, the transgenic plant exhibits a substantial reduction in reproductive capability. 
More preferably, the plant is substantially sterile. 

25 

Still yet a further aspect of the present invention is d r xted to a transgenic eucalypt 
plant such as a eucalypt selected from the previously mentioned list wherein the plant 
exhibits substantially reduced reproductive sterility. 
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The present invention is further described by the following non-limiting Figures and/or 
Examples. 

In the Figures: 

Figure 1 is a representation of the nucleotide sequence of genomic clone FLE2 with the 
predicted amino acid sequences. The positions of the PCR primers used to isolate the 
dene are underlined. 

Figure 2 is a representation of the nucleotide sequence of AGE\ cDNA with its predicted 
amino acid sec .:=nce. The MADS box and position of the PCR.primers used to isolate , 
the gene are underlined. - : ' ' 

Figure 3 is a representation of the nucleotide sequence of SQE\ cDNA. The MADS box 
is underlined; the amino acids at the positions of the PCR primers used to isolate tl.e 
gene are in italics. 

Figure 4 is a representation of the nucleotide sequence of SQE7.S cDNA with its 
predicted amino acid sequence. The MADS box is single underlined: the amino acids 
at the positions of the PCR primers used to isolate the genes arc double underlined. 

Figure 5 is a representation of the nucleotide sequence of SQE2L cDNA with its 
predicted amino acid sequence. The MADS box is single underlined: the amino acids 
at the positions of the PCR primers used to isolate the genes are double underlined. 

Figure 6 is a representation of nucl« uJe sequence of AGE! with its predicted amino 
acid sequence. The MADS box is single underlined; the amino acids at the positions 
of the PCR primers used to isolate the genes are double underlined. 
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Single and three letter abbreviations used for amino acid residues in the specification are 
defined in Table 1 . 



TABLE 1 



Amino Acid Three-letter One-letter 

Abbreviation * Symbol 



10 Alanine 
Arginine 
Asparagine 
Aspartic acid 
. . Cysteine 

15 Glutamine 
Glutamic acid 
Glycine 
Histidine 
Isoleucine 

20 Leucine 
Lysine 
Methionine 
Phenylalanine 
Proline 

25 Serine 
Threonine 
Tryptophan 
Tyrosine 
Valine 



Ala 


A 


Are 


R 


Asn 


N 


Asp 


D 


Cys 


C 


uln 


o 


Giu 


E 


Gly 


G 


His 


H 


He 


I 


Leu 


L 


Lys 


K. 


Met 


M 


Phe 


F 


Pro 


P 


Ser 


S 


Thr 


T 


Trp 


W 


Tyr 


Y 


Vai 


V 



951 l22.p:opcr^hJ>M95«9 nt.12 

- 12 - 

A summary of SEQ ID NOs used throughout the specification is shown in Table 

TABLE 2 



SEQ ID NO 


DEFINITION 


SEQ ID NO:l 


Genomic clone of FLE 


SEQ ID NO:2 


Predicted amino acid sequence of FLEA 


SEQ ID NO:3 


Genomic clone of FLE 


SEQ ID NO:4 


Predicted amino acid sequence of FLE-2 


SEO ID NO-5 


Genomic clone of FLE 


SEQ ID NO:6 


Predicted amino acid sequence of FLE-3 


SEQ ID HO:7 


cDNA AGE\ clone ■ • 


SEQ ID NO:8 


Predicted amino acid sequence of AGE\ 


SEQ ID NO:9 


cDNA SQE\ clone 


SEQ ID NO: 10 


Predicted amino acid sequence of SQE\ 


SEQ ID NO: 11 


cDNA SQE2S clone 


SEQ ID NO: 12 


Predicted amino acid sequence of SQE2S 


SEQ ID NO: 13 


cDNA SQE2L clone 


SEQ ID NO: 14 


Predicted amino acid sequence of SQE2L 


SEQ ID NO: 15 


cDNA AOE2 clone 


SEQ ID NO: 16 


Predicted amino acid sequence of AG E2 
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Identification Arabidopsis and Antirrhinum genes and the names of eucalypt homologues 
are shown in Table 3. 

TABLE 3 



Arabidopsis 


Antirrhinum 


Eucalyptus 


Leafy (LFY) 


Floricauia (FLO) 


FLE 


Apetala 1 (API) 


Squamosa (SQUA) 


SQE 


Agamous (AG) 


Plena (PLE) 


AGE 



15 EXAMPLE 1 

Isolation of the eucalypt genes 

The strategy used to isolate the eucalypt genes involved design and synthesis of 
polymerase chain reaction (PCR) primers to regions of amino acid homology between 
20 the Arabidopsis and Antirrhinum genes. These primers were then used to amplify from 
eucalypt DNA (FLE) or cDNA (AGE and SQE) short regions of DNA from the eucalypt 
homologue. The DNA fragments obtained were then used as probes to screen 
Eucalyptus globulus libraries to obtain the full length genes. 

25 EXAMPLE 2 

FLE 

Three copies of FLE were isolated from a Eucalyptus globulus genomic library. Two 
of these copies were not functional because the sequences were interrupted by stop 
30 codons and frame shifts. These two copies were found to be two alleles at a pseudogene 
locus. The other copy appears to be the homologue of the LFY and FLO genes in 
eucalypts. The predicted protein product of FLEl is 359 amino acids in length and 
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shares 67% identical aa with LFY and 71% identical aa with FLO. The nucleotide 
sequence (SEQ ID NO:l) and the predicted amino acid sequences (SEQ ID NO:2, 4 and 
6) of the genomic clone of FLE2 are shown in Figure 1 . 

5 Expression of FLE2 RNA was determined by in situ hybridization. FLE2 RNA was 
observed in developing floral organs in a similar manner to the expression patterns of 
FLORICAULA and LEAFY. In addition, FLE2 is strongly expressed in the primordia 
of true leaves as they develop on the flanks of apical and lateral vegetative meristems. 
Expression was not detected in the centre of the vegetative apicJ meristem or in stems 
10 or roots. lis observed expression in vegetative and floral tissue possibly suggests a 
wider role for this gene in Eucalyptus development. 

The promoter of the FLE2 gene was also isolated from the genomic library. One 
kilobfue (kb) of the promoter has been sequenced and 2 kb have been used in sterility 
. .15 . . constructs. :< 

EXAMPLE 3 
AGE! and AGE! 

20 Full length cDNA clones of AGE \ were isolated from a cDNA library constructed from 
RNA from mature flower buds. The gene encodes a protein which is 25 1 amino acids 
in length with amino acid homology between the AGE\ protein and the AG protein of 
about 68% and with the PLE protein, about 75%. The AG and PLE proteins contain a 
conserved region of 56 amino acids known as the MADS box region. This region is 

25 present in a number of different genes involved in floral development. The AGE\ gene 
also contains this region of 56 amino acids which is identical to that of the AG MADS 
box. Two of the 56 amino acids differ between PLE and AGE\. Southern blots of 
Eucalyptus globulus DNA probed with a fragment of AGE\ cDNA which does not 
contain the MADS box suggest that there is only one copy of the AGE\ gene in the 

30 eucalypt genome. The nucleotide sequence of the AGEl cDNA and the predicted amino 
acid sequence are shown in Figure 2. The nucleotide sequence of another cDNA clone, 
AGEl, is shown in Figure 6 with its predicted amino acid sequence. AGEl is 229 
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amino acids in length. It has good homology to both the AG and PLE proteins, of 62% 
and 64%, respectively. 

The fact that both AGE\ and AGE2 were isolated from a cDNA library is a good 
indication that these genes are functional. An RNA probe made from AGE\ was used 
to screen RNA isolated from mature, medium and young flower buds, anthers, leaves, 
stems and roots of Eucalyptus globulus. The hybridisation pattern suggests that AGE 
is expressed very highly in mature flower buds and not at all in leaves, stems or roots. 
The level of expression in anthers and medium flower buds is moderate. There is very 
low, if any, expression in young flower buds. This pattern of expression is very similar 
to that of AG and PLE. AGE2, however, is ^ongly expressed only in anthers, young 
flower buds and mature flower buds. It is weakly expressed in leaves. 

In order to isolate the AGE promoter, an Eucalyptus globulus genomic DNA library was 
screened * ith a fragment of AGE cDNA that does not contain the MADS region to 
avoid cross-hybridisation with other MADS-containing genes. A number of clones have 
been identified. The promoter is used to generate a series of sterility constructs in 
Arabidopsis and Eucalyptus. 

EXAMPLE 4 
SQEX and SQE2 

The fragment of the SQE I and SQE2 genes amplified from cDNA was from within the 
MADS box. When a cDNA library constructed from RNA from young flower buds was 
screened with this probe, four clones were isolated. One of these genes, SQE\ t contains 
a MADS box with 95% homology to API and 91% homology to SQUA. Sequencing 
of the gene has revealed 69% homology to API at the DNA level. The DNA sequence 
and predicted amino acid sequence of SQEX are shown in Figure 3. Other genes isolated 
are designated SQE2S having a DNA sequence and predicted amino acid sequence 
shown in Figure 4 (SEQ ID NOs 11 and 12, respectively) and SQE2L having a DNA 
sequence and predicted amino acid sequence shown in Figure 5 (SEQ ID NOs 13 and 
14, respectively). These genes will be investigated for their potential for use in the 
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sterility strategy. 

The promoters of the genes may be used to generate sterility constructs. 

5 EXAMPLE 5 

Ectopic Expression of the Eucalypt genes in Arabidopsis 
confirms their functional similarity to the Arabidopsis genes 

To examine the function of the two genes in flower development, AGE\ and ACE1 

1 0 cDNAs were ectopically expressed (also known as overexpression) in Arabidopsis plants 
under the control of the cauliflower mosaic virus 35S promoter. More than 60 
independent transgenic plants were regenerated for each construct. The most commonly 
observed phenotype in the primary transformants was that the sepals developed stigjnatic 
papillae, the petals became staminoid or were shorter than normal; while the stamens and 

15 carpels appeared normal or occasionally the carpels were unfused. Most of the plants, 
however, were sterile. 25% of 35S->IG£1 and 29% of 15S-AGE2 transformants showed 
this phenotype. Some plants showed a severe phenotype while others showed a weaker 
one. As most plants were sterile it was impossible to test the segregation of the 
transgene in the next generation. Of the plants that were fertile, most appeared to 

20 produce a normal phenotype in the next generation, suggesting that there was very weak 
expression of the gene. Based on the phenotype of the primary transformants it appears 
that (1) both the AGE\ and AGE2 genes from Eucalyptus are functionain Arabidopsis; 
(2) that they have a similar function to the AG gene of Arabidopsis; and (3) high levels 
of expression of either or both the AGEl and AGE2 genes in transgenic Eucalyptus may 

25 lead to sterile plants. Thus, expression of flowering genes using the CaMV35S promoter 
to cause disruption of flowering is an alternative strategy to that of antisense, co- 
suppression or RNase genes. 

To examine the function of the two eucalypt genes (SQEl and SQEZL) in flower 
30 development, and the Arabidopsis gene, API, the three cDNAs were ectopically 
expressed in Arabidopsis plants under the control of the CaMV35S promoter. More than 
50 independent transgenic plants were regenerated for each construct. The most 
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commonly observed phenotype in the prim.iry transformanis was a homeotic conversion 
of stem meristems into flowers - these plants resembled the Arabidopsis mutant terminal 
flowr (tfl). Development of the stem meristem, which is normally indeterminate, stops 
early in transgenic plants with a severe //7-like phenotype. Instead they form an 
5 abnormal flower at the top of the stems. The terminal flower consists of the 
disorganised aggregation of many floral organs. 30%, 43% and 30% of the transformed 
plants carrying 35S-^/ > l ? 35S-Sg£l and 35S-S0£2L, respectively, showed the //7-like 
phenotype. Although the severity and the details of the phenotype varied between 
independent transgenic plants, the overall ranges of phenotypes observed seemed to be 
10 similar between the three different constructs. 

The phenotype was analysed in detail in the Tj generation grown on soil. The //7-like 
phenotype observed in the T 0 generation was inherited in the next generation as a 
dominant Mendelian trait and it co-segregated with kanamycin. resistance. ^Xable 4 
15 summarises the characteristics of six independent transgenic lines-. T j transgenic plants 
flowered earlier compared to control plants. They formed an increased number of stems 
and were shorter than control plants. 

These results indicate that the Eucalyptus genes, SQE\ and SQE2L have the same 
20 function as the Arabidopsis API gene in flower development. 
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TABLE 4 

Comparison of flowering time, number of stems and height of main stem 
in transgenic (T,) and wild type (C24) Arabidopsis plants. 





1 iaU3gvuiv 

line 


Days to 
flowering 


Number of 
stems 


Height of main 
stem (cm) 


35S-/J/M 


A15 


17.0 


2.0 


5.2 




A52 


14.0 


3.0 . 


3.5 


35S-SQEI 


SI -23 


16.0 


5.5, 


2.5 




Sl-25 


13.0 


4.6 


2.0 


35S-SQE2L 


S2-16 


12.0' 


1.8 


2.3 




S2-28 


13.5 


5.3 


2.4 


Control 




19.7 


1.2 


18.8 



10 



15 



20 



25 



FXAMPLE 5 
Genetic Constructs Conferring Sterility : in vivo 
model using Arabidopsis 

A number of constructs are tested for their ability to prevent flowering in Arabidopsis 
(Table 5). Constructs are transformed into wild-type Arabidopsis and plants regenerated. 
One plant, from construct 1, independent transgenic line 1, had a phenotype very similar 
to that of the LEAFY mutant which exhibits female-fertility but is completely male 
sterile. This result strongly indicates that an antisense approach is capable of mimicking 
the mutant phenotypes observed in Arabidopsis. This plant was cross-pollinated and a 
small number of seeds collected for further analysis. One other plant from construct 1 
had a similar but less pronounced phenotype. Plants from all other lines of construct 
1 and the other constructs have been allowed to self-pollinate, seed from the TO plants 
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has been collected and planted ou! in soil. Seed of seven independent (TO) transgenic 
lines were planted out from construct I. One line (1-17) showed several plants (Tl) 
with an altered phenotype. The TO plant that produced this seed appeared normal. One 
of the eleven plants from this line (1-17) has a phenotype very similar to that of the 
strongest allele of the LEAFY mutant which is male-sterile with very much reduced 
female fertility. Nine plants have phenotypes similar to the mutants of the weaker 
alleles of LEAFY. These eleven plants will be further analysed. These results strongly 
support the hypothesis that we can produce sterile plants by introducing antisense to 
these genes into transgenic plants. 

Sterility constructs are made with antisense to LEAFY. API and both LEAFY and API 
in the one construct. These antisense constructs are expressed by the histone promoter 
H4A or the CaMV35S promoter which has been observed to give high -expression in 
young flower buds. r 

TABLE 5 

Constructs tested in Arabidopsis and the number of transformed 

plants obtained. 



Promoter 



Gene 



Number of plants 
obtained 



1. leafy 

2. leafy 

3. 35S 

4. FLE2 



antisense leafy 
antisense AG-MADS 
antisense leafy 
antisense FLE2 



52 
84 
30 
50 
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ITiose skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. u is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations of any twc or more of said steps or features. 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : genomic DNA 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 121.. 453 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CCACTACGTA CGTACATACA GTGTACATTG AATATACTAA ACAGATGGAC CCATGTGTTT 
ATAAAACAAG GAAGCGAGAG TCCTGTACCC TTACACGCTG AAGCCATTTG AAGCGCGAGA 

20 25 

*rr ATA GCG GAA ATG GGG TTT ACG GCC AAC ACC CTC CTC GAC *TG AAG 
S ne AUGU^ Gly Phe Thr Ala Asn Thr Leu Leu AsprMet Lys 
■ 35 40 45 

GAG GAG GAG CTC GAC GAC ATG ATG AAC TCC "f^/^^^Trg 
Glu Glu Glu Leu Asp Asp Met Met Asn Ser Leu Ser His lie Phe Arg 

50 55 60 

TGG GAC CTC CTC GTC GGC GAG CGC TAC GGC ATC AAG GCC GCC ATC CGC 
Tr? Asp Leu Leu Val Gly Glu Arg Tyr Gly lie Lys Ala Ala lie Arg 

65 70 5 

rrr pag CGC CGA CGC CTC CTC GAA GCC GAT GAC CGC CGC CAC CAC CTC 
Ala Glu S ?£ S Leu Leu Glu- Ala Asp- Asp Arg .Arg Hxs .Hxs. Leu 

85 90 

CAC TCC ACC GAC CAT GCC CTC CTC GAT GCT CTC TCC CAC CAA GGT 
£s ser Thr ^sp His Ala Leu Leu Asp Ala Leu Ser Hxs Gin Gly 

105 iXU 



100 



ACCTTAGCTC GGCCTCTTTG ATTTTTGACG TTTTGGGTCA TTATGATTAT TCCCACGCAG 
GGCTGTCGGA GGAACAAGTG GTGCAGCACT CAGAGAAGGA TCAGCTGGGC AGGGCGGGAA 
GCGGGGACAC GGCGGGCACG TCGTGGGGCG CCCAACAACA GAGAAAGAAG CATCGTCATC 
GTCACCACAT CACCGCGATG AAAGGAGCGG CCACGGAAGA GGACGAGGAG GACGAGGAGG 
AAGTGGAGGA GATGAGGAGG CAGAGGGAGC ACCCCTTCAT AGTGACGGAG CCCGGGGAGG 
TGGCGCGTGG GAAGAAGAAC GGCCTGGACT ACCTCTTCCA TCTCTACGAC CAGTGCCGCG 
ACTTCCTCCT CCAAGTCCAA TCCTTGGCCA AOGAGCGGGG CGAGAAATGC CCCACCAAGG 
TCCTCCGCCT CTCTTCTTTT TATAGTCTTT CGATACAATC AAACTTCTTC GAACGTACCC 



60 
120 
168 

216 

264 

312 

360 

408 

453 

513 
573 
633 
693 
753 
813 
873 
933 
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AAGCACGGAT CATTCAAACA AGAAGACAAT GCAGGTGACG AACCAGGTGT TCAGGTACGC 
GAAGAAGGCG GGAGCAAGCT ACATAAACAA GCCGAAGATG AGGCACTATG TCCACTGCTA 
CGCCCTGCAC TGCCTGGACG AGCACGCCTC CAACGCCCTT CGCAAGAGCT TCAAGGAGCG 
CGGGGAGAAC GTCGC-CGCCT GGAGGCAAGC CTGTTACCAC CCCCTGGTCA CCATCGCCGG 
CCGCAGGGCC GGCTGGGACA TCGACGCCAT CTTCAATGCC CACCCCCGCC TCTGCATCTG 
GTATGTCCCC ACCAAGCTCC GCCAGCTCTG CCATGCTCAC CGCCACTCCT CCGCCTCTpC 
TGCTTCCTCC GCTTCCACCT CCACCTCTCC CCClACCGCT CMTCATCTCG AACTCCCTTA 
CTAGTTCGTG CCCGTTCCTC ATTCTCTTGT GTTTTAGTGT CGTATCAGCG CATGATCATG 

AGGGAGATGA aagtactgtg ctgtggtcta tttctactgc aatgagtatv aattgaattt 

TAACGTGTAG CTGAGCTGAG CTGAGCT 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH : 111 amino acids 
(3) TYPE: amino acid 
( D / • . TOPOLC 3Y : linear 

(ii) MOLECULE TYPE.:, protein 

(xi) SEQUENCE 'DESCRIPTION: SEQ ID NO:?: 

Met Asp Pro Glu Ala Phe Ala Val Val Gly Leu Arg Thr Met Gly Gly 
15 10 15 

Leu Glu Glu Leu Phe Glu Ala Tyr Gly He Arg Tyr Leu Thr Ala Ser 
20 25 30 

Arg He Ala Glu Met Gly Phe Thr Ala As:i Tnr Leu Leu Asp Met Lys 
35 40 45 

Glu Glu Glu Leu Asp Asp Met Met Asn Ser Leu Ser His He Phe Arg 
50 55 60 

Trp Asp Leu Leu Val Gly Glu Arg Tyr Gly He Lys Ala Ala He Arg 
65 70 75 80 

Ala Glu Arg Arg Arg Leu Leu Glu Ala Asp Asp Arg Arg His Hi9 Leu 

85 90 95 

His Ser Thr Asp His Ala Leu Leu Asp Ala Leu Ser His Gin Gly 
100 105 110 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 516 . . 872 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



CCACTACGTA 


CGTACATACA 


GTGTACATTG AATATACTAA ACAGATGGAC 


CCATGTGTTT 


60 


ATAAAACAAG 


GAAGCGAGAG 


TCCTGTACCC TTACACGCTG 


AAGCCATTTG 


AAGCGCGAGA 


120 


ATGGATCCAG 


AAGCATTTGC 


GGTTGTGGGG TTGCGAACGA 


TGGGGGGACT 


GGAGGAGCTG 


180 


TTCGAGGCTT 


ATGGCATAAG 


GTACCTCACG GCCTCCAGGA 


TAGCGGAAAT 


GGGGTTTACG 


.240 


GCCAACACCC 


TCCTCGACAT 


GAAGGAGGAG GAGCTCGACG 


ACATGATGAA 


CTCCCTCTCC 


300 


' CACATCTTCC 


GCTGGGACCT 


CCTCGTCGGC GAGCGCTACG 


GCATCAA-3GC 


CGCCATCCGC 


360 


GCCGAGCGCC 


GACGCCTCCT 


CGAAGCCGAT GACCGCCGCC 


ACCACCTCCA 


CTCCACCGAC 


420 


CATGCCCTCC 


TCGATGCTCT 


CTCCCACCAA GGTACCTTAG 


CTCGGCCTCT 


TTGATTTTTG 


480 


ACGTTTTGGG 


TCATTATGAT 


TATTCCCACG CAGGG CTG TCG GAG GAA 


CAA GTG 


533 



Leu Ser Glu Glu Gin Val 
1 5 

GTG CAG CAC TCA GAG AAG GAT CAG CTG GGC AGG GCG GGA AGC GGG GAC 581 
Val Gin His Ser Glu Lys Asp Gin Leu Gly Arg Ala Gly Ser Gly Asp 
10 15 70 

ACG GCG GGC ACG TCG TGG GGC GCC CAA CAA CAG AGA AAG AAG CAT CGT 62 9 

Thr Ala Gly Thr Ser Trp Gly Ala Gin Gin Gin Arg Lys Lys His Arg 
25 30 35 

CAT CGT CAC CAC ATC ACC GCG ATG AAA GGA GCG GCC ACG GAA GAG GAC 677 
His Arg His His He Thr Ala Met Lys Gly Ala Ala Thr Glu Glu Asp 
40 45 50 

GAG GAG GAC GAG GAG GAA GTG GAG GAG ATG AGG AGG CAG AGG GAG CAC 725 
Glu Glu Asp Glu Glu Glu Val Glu Glu Met Arg Arg Gin Arg Glu His 
55 60 65 70 

CCC TTC ATA GTG ACG GAG CCC GGG GAG GTG GCG CGT GGG AAG AAG AAC 773 
Pro Phe He Val Thr Glu Pro Gly Glu Val Ala Arg Gly Lys Lys Asn 

75 80 85 

GGC CTG GAC TAC CTC TTC CAT CTC TAC GAC CAG TGC CGC GAC TTC CTC 821 
Gly Leu Asp Tyr Leu Phe His Leu Tyr Asp Gin Cys Arg Asp Phe Leu 
90 95 100 

CTC CAA GTC CAA TCC TIG GCC AAG GAG CGG GGC GAG AAA TGC CCC ACC 869 
Leu Gin Val Gin Ser Leu Ala Lys Glu Arg Gly Glu Lys Cys Pro Thr 
105 lin us 
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AAG GTCCTCCGCC ± s.TC XTCTTT TTATAGTCTT TCGATACAAT CAAACTTCTT 922 
Lys 

CGAACGTACC CAAGCACGGA TCATTCAAAC AACAAGACAA TGCAGGTGAC GAACCAGGTG 982 

j TTCAGGTACG CGAAGAAGGC GGGAGCAAGC TACATAAACA AGCCGAAGAT GAGGCACTAT 1042 

I GTCCACTGCT ACGCCCTGCA CTGCCTGGAC GAGCACGCCT CCAACGCCCT TCGCAAGAGC 1102 

t TTCAAGGAGC GCGGGGAGAA CGTCGGCGCC TGGAGGCAAG CCTGTTACCA CCCCCTGGTC 1162 

ACCATCGCCG GCCGCAGGGC CGGCTGGGAC ATCGACGCCA TCTTCAATGC CCACCCCCGC 1222 

j CTCTGCATCT GGTATGTCCC CACCAAGCTC CGC CAGCTCT GCCATGCTCA CCGCCACTCC 1282 

j 

! TCCGCCTCTG CTGCTTCCTC CGCTTCCACC TCCACCTCTG CCCCCACCGC TCACCATCTC 1342 

GAACTCCCTT ACTAGTTCGT GCCCGTTCCT CATTCTCTTG TGTTTTAGTG TCGTATCAGC 1402 
GC^TGATCAT GAGGGAGATG AAAGTACTGT GCTGTGGTCT ATTTCTACTG CAATGAGTAT 1462 
TAATTGAATT TTAACGTGTA. GCTGAGCTGA GCTGAGCT . . .. . . -_,\150.Q 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Leu Ser Glu Glu Gin Val Val Gin His Ser Glu Lys Asp Gin Leu Gly 
1 5 10 15 

Arg Ala Gly Ser Gly Asp Thr Ala Gly Thr Ser Trp Gly Ala Gin Gin 
20 25 30 

Gin Arg Lys Lys His Arg His Arg His His He Thr Ala Met Lys Gly . 
.35 40 45 

Ala Ala Thr Glu Glu Asp Glu Glu Asp Glu Glu Glu Val Glu Glu Met 
50 55 60 

^ Arg Arg Gin Arg Glu His Pro Phe He Val Thr Glu Pro Gly Glu Val 

65 70 75 80 

Ala Arg Gly Lys Lys Asn Gly Leu Asp Tyr Leu Phe His Leu Tyr Asp 

85 90 95 

Gin Cys Arg Asp Phe Leu Leu Gin Val Gin Ser Leu Ala Lys Glu Arg 
100 105 110 

Gly Glu Lys Cys Pro Thr Lys 
115 




I 

i 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

<ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 968.. 1354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCACTACGTA CGTACATACA GTGTACATTG AATATACTAA ACAGATGGAC CCATGTGTTT 60 

ATAAAACAAG GAAGCGAGAG TCCTGTACCC TTACACGCTG AAGCCATTTG AAGCGCGAGA 12 0 

ATGGATCCAG AAGCATTTGC GGTTGTGGGG TTGCGAACGA TGGGGGGACT GGAGGAGCTG 180 

TTCGAGGCTT ATGGCATAAG GTACCTCACG GCCTCCAGGA TAGCGGAAAT GGGGTTTACG 240 

GCCAACACCC TCCTCGACAT GAAGGAGGAG GAGCTCGACG ACATGATGAA CTCCCTCTCC 3 00 

CACATCTTCC GCTGGGACCT CCTCGTCGGC GAGCGCTACG GCATCAAGGC CGCCATCCGC 360 

GCCGAGCGCC GACGCCTCCT CGAAGCCGAT GACCGCCGCC ACCACCTCCA CTCCACCGAC 420 

CATGCCCTCC TCGATGCTCT CTCCCACCAA GGTACCTTAG CTCGGCCTCT TTGATT TT TG 480 

ACGTTTTGGG TCATTATGAT TATTCCCACG CAGGGCTGTC GGAGGAACAA GTGGTGCAGC 540 

ACTCAGAGAA GGATCAGCTG GGCAGGGCGG GAAGCGGGGA CACGGCGGGC ACGTCGTGGG 600 

GCGCCCAACA ACAGAGAAAG AAGCATCGTC ATCGTCACCA CATCACCGCG ATGAAAGGAG 660 

CGGCCACGGA AGAGGACGAG GAGGACGAGG AGGAAGTGGA GGAGATGAGG AGGCAGAGGG 720 

AGCACCCCTT CATAGTGACG GAGCCCGGGG AGGTGGCGCG TGGGAAGAAG AACGGCCTGG 780 

ACTACCTCTT CCATCTCTAC GACCAGTGCC GCGACTTCCT CCTCCAAGTC CAATCCTTGG 840 

CCAAGGAGCG GGGCGAGAAA TGCCCCACCA AGGTCCTCCG CCTCTCTTCT TTTTATAGTC 900 

TTTCGATACA ATCAAACTTC TTCGAACGTA CCCAAGCACG GATCATTCAA ACAAGAAGAC 960 

AATGCAG GTG ACG AAC CAG GTG TTC AGG TAC GCG AAG AAG GCG GGA GCA 1009 
Val Thr Asn Gin Val Phe Arg Tyr Ala Lys Lys Ala Gly Ala 
15 10 

AGC TAC ATA AAC AAG CCG AAG ATG AGG CAC TAT GTC CAC TGC TAC GCC 1057 
Ser Tyr He Asn Lys Pro Lys Met Arg His Tyr Val His Cys Tyr Ala 
15 20 25 30 

CTG CAC TGC CTG GAC GAG CAC GCC TCC AAC GCC CTT CGC AAG AGC TTC 1105 
Leu Hi* Cys Leu Asp Glu His Ala Ser Asn Ala Leu Arg Lys Ser Phe 

35 40 45 



AAG GAG CGC GGG GAG AAC GTC GGC GCC TGG AGG CAA GCC TGT TAC CAC 
Lys Glu Arg Gly Glu Asn Val Gly Ala Trp Arg Gin Ala Cys Tyr His 
50 55 60 



1153 
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ccc 

Pro 



ATC 
He 



CTC 
Leu 
95 

TCC 
Ser 



CTC 
Leu 



CTG GTC 
Leu Val 
65 

TTC AAT 
Phe Asn 
80 

CGC CAG 
Arg Gin 



TCC GCT 
Ser Ala 



CCT TAC 
Pro Tyr 



ACC ATC GCC GGC CGC AGG GCC GGC TGG GAC ATC GAC GCC 
Thr He Ala Gly Arg Arg Ala Gly Trp Asp He Asp Ala 

70 75 

GCC CAC CCC CGC CTC TGC ATC TGG TAT GTC CCC ACC AAG 
Ala His Pro Arg Leu Cys He Trp Tyr Val Pro Thr Lys 
85 90 

CTC TGC CAT GCT CAC CGC CAC TCC TCC GCC TCT GCT GCT 
Leu Cys His Ala His Arg His Ser Ser Ala Ser Ala Ala 
100 105 HO 

TCC ACC TCC ACC TCT GCC CCC ACC GCT CAC CAT CTC GAA 
Ser Thr Ser Thr Ser Ala Pro Thr Ala His His Leu Glu 
115 120 125 

TAGTTCGTGC CCGTTCCTCA TTCTCTTGTG TTTTAGTGTC 



1201 



1249 



1297 



1345 



1394 



GTATCAGCGC ATGATCATGA GGGAGATGAA AGTACTGTGC TGTGGTCTAT TTCTACTGCA 14 54 
ATGAGTATTA ATTGAATTTT AACGTGTAGC TGAGCTGAGC TGAGCT 1500 

(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) - SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Val Thr 
1 

He Asn 



Cys Leu 



Asn Gin 



Lys Pro 
20 

Asp Glu 
35 



Arg Gly 
50 

Val Thr 
65 



Glu Asn 
He Ala 
Asn Ala His Pro 



Gin Leu Cys His 
100 

Ala Ser Thr Ser 
115 



Val Phe Arg 

5 

Lys Met Arg 

His Ala Ser 

Val Gly Ala 
55 

Gly Arg Arg 
70 

Arg Leu Cys 
85 

Ala His Arg 
Thr Ser Ala 



Tyr Ala Lys Lys 
10 

His Tyr Val His 
25 

Asn Ala Leu Arg 
40 

Trp Arg Gin Ala 



Ala Gly Trp Asp 
75 

He Trp Tyr Val 
90 

His Ser Ser Ala 
105 

Pro Thr Ala His 
120 



Ala Gly Ala 



Cys Tyr Ala 
30 

Lys Ser Phe 
45 

Cys Tyr His 
60 

He Asp Ala 



Ser Tyr 
15 

Leu His 
Lys Glu 
Pro Leu 



Pro Thr Lys 



Ser Ala Ala 
110 

His Leu Glu 
125 



He Phe 
80 

Leu Arg 
95 

Ser Ser 



L^u Pro 



Tyr 
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(2) INFORMATION FOR SEQ ID NO . 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 154 . . 906 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 



CGGCCGCCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTGGT TCAAGAACAC ACCAAAGAAG 6 0 
AGAGACCCAT GTTAAGCTGT TAGTATCTGG AGTCTTATTC TTTTCATATT TTTCTCCAAA 120 

GCATCCTCGT CCCCAAGGTT TTTTTGCGGC AGT ATG GAG TTC CCG AGT GAA TTT 174 

Met Glu Phe Pro 3er Glu Phe 

■ • i . 5 

TCA GAG GCC TCT TCA CAG AAG AGA ATC GGG GGG AG A GGG AAA ATA GAG 222 
Ser Glu Ala Ser. Ser Gin Lys Arg He Gly Gly Arg Gly Lys He Glu 
10 15 20 

ATC AAA CGG ATC GAG AAC ACG ACG AAC CGG CAG GTC ACC TTT TGT AAA 270 
He Lys Arg He Glu Asn Thr Thr Asn Arg Gin Val Thr Phe Cys Lys 
25 30 35 

CGC CGG AAC GGG TTG TTG AAG AAG GCT TAT GAG CTA TCG GTG TTG TGC 318 
Arg Arg Asn Gly Leu Leu Lys Lys Ala Tyr Glu Leu Ser Val Leu Cys 
40 45 50 55 

GAT GCT GAA GTG GCG CTT ATT GTC TTC TCG AGC CGT GGC AGG CTC TAT 366 
Asp Ala Glu Val Ala Leu He Val Phe Ser Ser Arg Gly Arg Leu Tyr 

60 65 70 

GAA TAT GCT AAC AAC AGT GTC AGA GGA ACA ATT GAG AGG TAC AAG AAA 414 
Glu Tyr Ala Asn Asn Ser Val Arg Gly Thr He Glu Arg Tyr Lys Lys 
75 80 85 

GCA AGC AGT GAT TCC TCA CAT CCA C^G TCC GTT TCT GAA GTG AAC ACT 462 
Ala Ser Ser Asp Ser Ser His Pro Gin Ser Val Ser Glu Val Asn Thr 
90 '95 100 

CAG TTT TAT CAG CAA GAA GCA TCC AAG CTT CGG AGA CAG ATA AGA GAA 510 
Gin Phe Tyr Gin Gin Glu Ala Ser Lys Leu Arg Arg Gin He Arg Glu 
105 110 115 

ATC CAG GTC TCA AAT AGG CAT ATT CTA GGT GAG GGT ATA AGT GAT TTG 558 
He Gin Val Ser Asn Arg His He Leu Gly Glu Gly He Ser Asp Leu 
120 125 130 135 

AGC TTC AAG GAT CTC AAG AAT CTC GAG AGC AAA TTA GAG AAA TCG ATC 606 
Ser Phe Lys Asp Leu Lys Asn Leu Glu Ser Lys Leu Glu Lys Ser He 

140 145 150 

AGC CGA GTT AGA TCA AAG AAG AAT GAG ATG CTT TTT GCC GAG ATT GAG 654 
Ser Arg Val Arg Ser Lys Lys Asn Glu Met Leu Phe Ala Glu He Glu 
155 160 165 
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5 S K^lf/iSK-- 

170 175 
200 205 

CCC CTG CCG TCT CAG f " X. 

Ala Leu Pro Ser Gin Pro Ala Tyr Asp Arg 23Q 

220 " s 

GTC CTG GAA CCA AAT CAC CAA TCT ^TAT TCT CGA TCC : «C « iKTGCC^ 
val Leu Glu Pro Asn His Olr. Ser Tyr Ser Arg » v ^ 
235 240 



CTC CAA CTC GTC TAATTAAAGC TCAGTTAATA TGTAGCACTA GCAACTGGAC 

Leu Gin Leu Val 

GTATTGGCAC TCTAGATTAT GCTTTTTAAT . ATGCTAGAGT TAGTTGAAAG AGGAATCAAT 
GAACTGGGGA GAGACCCGCA TGATGAGTTT CGCCTAAACA GTGGAAGAAC TCTCTGATAT 
TTCTCTTCAT CACTCTTCAA ACTCTTCTGA TGAATCGTGT TGTACAAGGG TTATTGCAAT 
TTTTP" TAG TACTTAA 



(2) INFORMATION FOR SEQ ID NO:8: 



■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inea r 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met 
1 


Glu 


Phe Pro 


Ser 
5 


Glu 


Phe 


Gly Gly 


Arg Gly 
20 


Lys 


He 


Glu 


Arg 


Gin 


Val .Thr 
35 


Phe 


Cys 


Lys 


Tyr 


Glu 
50 


Leu Ser 


Val 


Leu 


Cys 
55 


Ser 
65 


Ser 


Arg Gly 


Arg 


Leu 
70 


Tyr 


Thr 


He 


G?.u Arg 


Tyr 
65 


Lys 


Lys 



10 15 



TS ' 30 



40 45 

\sp Ala Glu Val Ala Leu He Val Phe 

60 

3lu Tyr Ala Asn Asn Ser Val Arg Gly 
75 80 

Ma Ser Ser Asp Ser Ser His Pro Gin 
90 95 



702 

750 

798 

846 

894 

946 

1006 
1066 
1126 
1143 
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Ser Val 3er Glu Val Asn Thr Gin Phe Tyr Gin Gin Glu Ala Ser Lys 
100 105 no y 

Leu Arg Arg Gin He Arg Glu lie Gin Val Ser Asn Arg His lie Leu 
lis 120 125 

Gly Glu Gly lie Ser Asp Leu Ser Phe Lys Asp Leu Lys Asn Leu Glu 
JU 135 140 

Ser Lys Leu Glu Lys Ser lie Ser Arg Val Arg Ser Lys Lys Asn Glu 

155 160 

Met Leu Phe Ala Glu He Glu Tyr Met Gin Lys Arg Glu He Glu Leu 

165 170 175 

Gin Asn Asp Asn Met Tyr Leu Arg Ala Lys He Ala Glu Asn Glu Arg 
180 185 190 

Ala Gin Gin Gin Gin Gin Gin Gly Ser Asp His His Phe Asn Met P~o 
195 200 205 

Gly Ser Ser Ser Val Tyr Glu Ala Leu Pro Ser Gin Pro Ala Tyr Asp 

215 220 

Arg Asn Phe Leu Gin Val Asn Val Leu Glu Pro Asn His -Gin Ser Tyr. • 

230 ; 235 240 

Ser Arg Ser Asp His Thr Ala Leu Gin Leu Val 

245 250 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 107 838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC:9: 

TAACGCCCCA TCTCTCTTCT CTCTCTCTCT CTCTCTGTGG CTTTGTCTTT TCTTTTGTTT 
CTTGCCGTTT TGGCGTGTGT GTTGGGTTGT GTGAATTGGA GCGAGG ATG GGG AGG • 

Met Gly Arg 

S5 5 2 31 2 

io 15 

£ £e £ £ £J A A^ G a ^™ "5 T - G CAC GAG ATC 

20 3 of Y GU Leu Lys h Y 3 His Glu lie 

30 u 



SO 
115 

163 

211 
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TCC G1C CTC TGC GAC GCC GAG GTC GCC CTC ATC GTC TTC TCC GCC AAG 2 59 

Ser Val Leu Cys Asp Ala Glu Val Ala Leu lie Val Phe Ser Ala Lys 

40 45 50 

GGC AAG CTC TTC GAG TAC TCC ACC GAT TCC TGC ATG GAG AGA ATT CTC 307 
Gly Lys Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu Arg He Leu 
55 60 65 

GAA CGC TAT GAA AGA TAC TCA TAT GCG GAG CAC CAA GTT CTT GCA AGC 355 
Glu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu His Gin Val Leu Ala Ser 
70 75 80 

GAG ACG GAA TCG ATT GGT AGC TGG ACT TTG GAG CAT GCT AAG CTC AAG 4 03 

Glu Thr Glu Ser He Gly Ser Trp Thr Leu Glu His Ala Lys Leu Lys 
85 90 95 

GCC AGA CTT GAA GTT TTA CAC AGA AAT TAT AGG CAT TTC ATG GGC GAA 451 
Ala Arg Leu Glu Val Leu His Arg Asn Tyr Arg His Phe Met Gly Glu 
100 105 no us 

GAT CTT GAT TCT TTG AGT CTC AAG GAC CTC CAA AAT TTG GAG CAG CAA 4 99 

Asp Leu Asp Ser Leu Ser Leu Lys Asp Leu Gin Asn Leu Glu Gin Gin 

120 125 130 

CTG GAG TCT GCT CTT AAA CAC ATA AGA TCG AGA AAG AAT CAG- CTC ATG 54 7 

Leu Glu Ser Ala Leu Lys His He Arg Ser Arg Lys Asn Gin Leu Met 
135 140 145 

CAT GAA TCA ATC TCA GTG ATT CAG AAA AAG GAT AGG GCA TTG CAG GAG 595 
His Glu Ser He Ser Val He Gin Lys Lys Asp Arg Ala Leu Gin Glu 
ISO 155 160 

CAA AAT AAC CTG CTT ACA AAG AAA ATA AAG GAG AAG GAG AGG GCA CTA 643 
Gin Asn Asn Leu Leu Thr Lys Lys He Lys Glu Lys Glu Arg Ala Leu 
165 170 175 

GCA CAG CAA GCT CAG TGG GAG CAG CAA GAC CAT GCC CTT GAC TCA CCT 6 91 

Ala Gin Gin Ala Gin Trp Glu Gin Gin Asp His Ala Leu Asp Ser Pro 
180 185 190 195 

GTT GTT CTA CCC CAC TAC TTG CCA TCT CTC GAC ATC AAT GGC TCT TAT 739 
Val Val Leu Pro His Tyr Leu Pro Ser Leu Asp He Asn Gly Ser Tyr 

200 205 210 

CAA GCG AGA CAC AAC GGA CAC GAT GAC GGA GAG AAC CTG ACT CAG CCT 787 
Gin Ala Arg His Asn Gly His Asp Asp Gly Glu Asn Leu Thr Gin Pro 
215 220 225 

CGG GCT GGT ACA CTT CTT CCT CCG TGG ATC TCT GCA TCT CAA TTA AGG 835 
Arg Ala Gly Thr Leu Leu Pro Pro Trp He Ser Ala Ser Gin Leu Arq 
230 235 240 

CTT TGACATAAAA GGGAACTTCT GCTGCTATAT CTTTGATAAG CAATTCTGAA 888 
Leu 

GTCAAGTGAT GAATTGCATG GCATAATCTC AGGCTCTCCC CCACCCTTGT TCTAGCGAAA 948 

TAAACATGTC CCTGCTTTTT GCTGCATTCA GTACTCCTAA GTGTGGTCCT CCGTGTATGT 1008 

ATAGCACCAT TGCAAATATG CATACCAATG TACATGTATG CAAAATTATA TATGGAAGTT 1068 
AACACAACCA TAAAAA 

1 U84 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Gly Arg Gly Arg Leu Gin Leu Lys Arg He Glu Asn Lys He Asn 
1 5 10 is 

Arg Gin Val Thr Phe Ser Lys Arg Arg Ala Gly Leu Leu Lys Lys Ala 
20 25 30 

His Glu He Ser Val Leu Cys Asp Ala Glu Val Ala Leu He Val Phe 
35 40 45 

Ser Ala Lys Gly Lys Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu 
50 1 . 55 60 

Arg He Leu Jlu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu His Gin Val 
65 70 75 80 

Leu Ala Ser Glu Thr Glu. Ser vile Gly Ser Trp Thr Leu Glu His Ala 

85 90 95 

Lys Leu Lys Ala Arg Leu Glu Val Leu His Arg Asn Tyr Arg His Phe 
100 105 no 

Met Gly Glu Asp Leu Asp Ser Leu Ser Leu Lys Asp Leu Gin Asn Leu 
115 120 125 

Glu Gin Gin Leu Glu Ser Ala Leu Lys His He Arg Ser Arg Lys Asn 
130 135 140 

Gin Leu Met His Glu Ser He Ser Val He Gin Lys Lys Asp Arg Ala 
14 5 150 155 160 

Leu Gin Glu Gin Asn Asn Leu Leu Thr Lys Lys He Lys Glu Lys Glu 

165 170 175 

Arg Ala Leu Ala Gin Gin Ala Gin Trp Glu Gin Gin Asp His Ala Leu 
180 185 190 

Asp Ser Pro Val Val Leu Pro His Tyr Leu Pro Ser Leu Asp He Asn 
195 200 20S 

Gly Ser Tyr Gin Ala Arg His Asn Gly His Asp Asp Gly Glu Asn Leu 
210 215 220 

Thr Gin Pro Arg Ala Gly Thr Leu Leu Pro Pro Trp He Ser Ala Ser 
225 230 235 240 

Gin Leu Arg Leu 
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(2) INFORMATION rOti SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 
(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 184.. 799 

(xi) • SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

TCGAACAGTT CAGAAAGACG GATAGATACA TACACAGAGA CATCGATTGC ATCATCTTCT 6 0 

CTACCTCATT GTOCGTGCGG TGCGTGCGGT GCGTGCTTTA TATAGACGCT CGCTCTTACG 12 0 

TTGTTCTCGT CCGCCCGGTG GTGTGGTGGT TGGAGAGAGA GGGAGAGAGA GTAGAGAGAG 18C 

■AAA ATG GGG AGG GGG AGG GTG CAG CTG AAG AGG ATA GAG AAC AAG ATA 22 8 

Met Gly Arg Gly Arg Val Gin Leu Lys Arg lie Glu Asn Lys lie 
15 10 15 

AAC AGG CAA GTG ACG TTC TCG AAG CGG AGA TGC GGG CTG CTG AAG AAG 276 
Asn Arg Gin Val Thr Phe Ser Lys Arg Arg Cys Gly Leu L£U Lys Lys 

20 25 30 

GCC CAC GAG ATC TCC GTC CTC TGC GAC GCC GAC GTC GCC CTC ATC GTC 3 24 

Ala His Glu lie Ser Val Leu Cys Asp Ala Asp Val Ala Leu lie Val 
35 40 45 

TTC TCC ACC AAG GGC AAG CTC TTC GAG TAC GCC ACC GAC TGT TGC ATG 3 72 

Phe Ser Thr Lys Gly Lys Leu Phe Glu Tyr Ala Thr Asp Cys Cys Met 
50 55 60 

GAG AGG ATC CTC GAG CGT TAT GAG AGA TAT TCA TAT GCA GAG AGC CAG 420 
Glu Arg lie Leu Glu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu Ser Gin 
65 70 75 

GTT CTC ACA AAC AAT GCC GAA ACC AAT GGG AAC TGG ACT TTG GAA CAT 468 
Val Leu Thr Asn Asn Ala Glu Thr Asn Gly Asn Trp Thr Leu Glu His 
80 85 90 95 

GCA AAA CTC. AAG GCC AGG ATG GAG ATT CTG CAG AAA AAT CAA AAG AAT 516 
Ala Lys Leu Lys Ala Arg Met Glu lie Leu Gin Lys Asn Gin Lys Asn 

100 105 110 

CTG ATG GGA GAA GAA CTC GAT TCT CTA AGC CTC AAA GAG CTT CAG AAT 564 
Leu Met Gly Glu Glu Leu Asp Ser Leu Ser Leu Lys Glu Leu Gin Asn 
115 120 125 

TTG GAG CAT CAG CTT GAC ACT GCT CTT AAG AAC ATT CGG TCT AGA AAG 612 
Leu Glu His Gin Leu Asp Thr Ala Leu Lys Asn lie Arg S'*r Arg Lys 
130 135 140 

ATT CAA CTC ATG TGT GAA TCC ATC TCT GAG CTT CAG AGA AAG GAT AAG 660 
He Gin Leu Met Cys Glu Ser He Ser Glu Leu Gin Arg Lys Asp Lys 
145 150 15S 

GCG CTG CAA GAA CAA AAC AAC ATG CTG GCA AAG AAG GTG AAA GAG AAG 708 
Ala Leu Gin Glu Gin Asn Asn Met Leu Ala Lys Lys Val Lys Glu Lys 
160 165 170 175 
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GAG AAG GCA CTG GCA CAG CAA ACT CAA TGG GAC AAC CCG CAA GAT GAT 
Glu Lys Ala Leu Ala Gin Gin Thr Gin Trp Asp Asn Pro Gin Asp Asp 

180 185 190 

GGC CTC ACC TCT TCC TCG GGG TCC TTA CCA TCC GAG TGG AAT T 
Gly Leu Thr Ser Ser Ser Gly Ser Leu Pro Ser Glu Trp Asn 
195 200 205 

GAGGAGGGAG CGGCCCTCGG TCCACCTCAA CATCGTAATG CCACCCTTTT TCCTTCATOG 

ATGCTTAGCC ACCTCCAAGC CTAATTGTCC AATTTATTTG CATGGGAACC ATATATCATT 

ACCAATATTA TCTAATGCTT CTTATCGTAT ATATCCAGTC GATATGTAAC CATAATGTAA 

CATGTCTTAG TACATGCATT GTGGGGATAC TTGCAAAAAC CTTGTTTTGT AGG 



(2) INFORMATION FOR SEQ ID NO: 12: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH:. ,205 amino acids 

(B) TYPfi: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:12: 



Met Gly Arg Gly Arg Val Gin Leu Lys Arg lie Glu Asn Lys He Asn 
1 5 io 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Cys Gly Leu Leu Lys Lys Ala 
20 25 30 

His Glu He Ser Val Leu Cys Asp Ala Asp Val Ala Leu He Val Phe 
35 40 45 

Ser Thr Lys Gly Lys Leu Phe Glu Tyr Ala Thr Asp Cys Cys Met Glu 
50 55 60 

Arg He Leu Glu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu Ser Gin Val 
65 70 75 80 

Leu Thr Asn Asn Ala Glu Thr Asn Gly Asn Trp Thr Leu Glu His Ala 

S5 90 9 5 

Lys Leu Lys Ala Arg Met Glu He Leu Gin Lys Asn Gin Lys Asn Leu 
100 105 no 

Met Gly Glu Glu Leu . Asp Ser Leu Ser Leu Lys Glu Leu Gin Asn Leu 
115 120 125 

Glu His Gin Leu Asp Thr Ala Leu Lys Asn lie Arg Ser Arg Lys He 
130 i3s 140 

Gin Leu Met Cys Glu Ser He Ser Glu Leu Gin Arg Lys Asp Lys Ala 
145 150 155 i 6 o 

Leu Gin Glu Gin Asn Asn Met Leu Ala Lys Lys Val Lys Glu Lys Glu 

165 170 !7s 

Lys Ala Leu Ala Gin Gin Thr Gin Trp Asp Asn Pro Gin Asp Asp Glv 
180 185 i 9 o 

Leu Thr Ser Ser Ser Gly Ser Leu Pro Ser Glu Trp Asn 
195 200 205 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 184.. 920 

(xi) SEQUENCE DESCRIPTION: SEQ ir NO: 13: 

TCGAACAGTT CAGAAAGACG GATAGATACA TACACAGAGA CATCGATTGC ATCATCTTCT 60 

CTACCTCATT GTGCGTGCGG TGCGTGCGGT GCGTGCTTTA TATAGACGCT CGCTCTTACG 120 

TTG7TCTCGT CCGCCCGGTG GTGTGGTGGT TGGAGAGAGA GGGAGAGAGA GTAGAGAGAG 180 

AAA ATG GGG AGG GGG AGG GTG GAG CTG AAG AGG ATA GAG AAC AAG ATA 228 
Met Gly Arg Gly Arg Val Gin Leu Lys Arg lie Glu Asn Lys lie 
15 10 15 

AAC AGG CAA GTG ACG TTC TCG AAG CGG AGA TGC GGG CTG CTG AAG AAG 2 76 

Asn Arg Gin Val Thr Phe Ser Lys Arg Arg Cys Gly Leu Leu Lys Lys 

20 25 30 

GCC CAC GAG ATC TCC GTC CTC TGC GAC GCC GAC GTC GCC CTC ATC GTC 324 
Ala His Glu He Ser Val Leu Cys Asp Ala Asp Val Ala Leu He Val 
35 40 45 

TTC TCC ACC AAG GGC AAG CTC TTC GAG TAC GCC ACC GAC TGT TGC ATG 3 72 

Phe Ser Thr Lys Gly Lys Leu Phe Glu Tyr Ala Thr Asp Cys Cys Met 
50 55 60 

GAG AGG ATC CTC GAG CGT TAT GAG AGA TAT TCA TAT GCA GAG AGC CAG 420 
Glu Arg He Leu Glu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu Ser Gin 
65 70 75 

GTT CTC ACA AAC AAT GCC GAA ACC AAT GGG AAC TGG ACT TTG GAA CAT 468 
Val Leu Thr Asn Asn Ala Glu Thr Asn Gly Asn Trp Thr Leu Glu His 
80 85 90 95 

GCA AAA CTC AAG GCC AGG ATG GAG ATT CTG CAG AAA AAT CAA AAG AAT 516 
Ala Lvs'Leu Lys Ala Arg Met Glu He Leu Gin Lys Asn Gin Lys Asn 

100 105 HO 

CTG ATG GGA GAA GAA CTC GAT TCT CTA AGC CTC AAA GAG CTT CAG AAT 564 
Leu Met Gly Glu Glu Leu Asp Ser Leu Ser Leu Lys Glu Leu Gin Asn 
115 120 125 

TTG GAG CAT CAG CTT GAC ACT GCT CTT AAG AAC ATT CGG TCT AGA AAG 612 
Leu Glu His Gin Leu Asp Thr Ala Leu Lys Asn He Arg Ser Arg Lys 
130 135 140 

ATT CAA CTC ATG TGT GAA TCC ATC TCT GAG CTT CAG AGA AAG GAT AAG 660 
He Gin Leu Met Cys Glu Ser He Ser Glu Leu Gin Arg Lys Asp Lys 
145 ISO 155 

GCG CTG CAA GAA CAA AAC AAC ATG CTG GCA AAG AAG GTG AAA GAG AAG 708 
Ala Leu Gin Glu Gin Asn Asn Met Leu Ala Lys Lys Val Lys Glu Lys 
160 165 170 175 



• 
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S? 2S S E S S ^ ^iW^T^k 



200 205 



K S SJ Sf y S ^ S %Wff.-S?.«g„-s y -s«. 

215 220 

25S S 35 X 3S ^f/SA" ccr ra TO STC 

225 230 Phe Pro Ser Tr P Met 



^ Ser SE feu ^ n cfu TA ATGATCC ^ TTATTTGCAT GGGAACCATA 



240 245 



(2) INFORMATION FOR SEQ ID NO:14 : 

( 1 ) S EQUENCE • CHARACTER I STICS • 

f ( «! ™ TT " 245 amino 
(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ. ID NO:14 : 

Met Oly Ar 3 Gly Ar f v.l 01a Leu Lys Ar* „. Glu ^ Lys ^ 

Cln v al T Hr Phe ser L ys Ar 9 Ar f Cy S Gly Leu Leu ^ £ ^ 

His «» xi. Ser val Leu ^ AU ^ v ^ ii 3 ° v ^ ^ 

* U 45 
S« T£ Ly . aly Ly , L .„ ^ ^ ^ ^ ^ ^ ^ ^ ^ 

««| ii. l«, «a« a,, ^ 81u „, ^ „ r ^ J" oiu sm ^ ^ 
u. «, a.„ _ ai, 01 „ Tllr Gly ^ ^ ^ ^ mu ^ J° 
u. x. y . ai, Mec 01u Ile a Qln Lya ^ ^ j= ^ 
« «r g j «. u. *p s„ g. s „ Lau ly , 01u ^ ™ 

125 

». «J «. u. A., rar „ f _ Ly , „„ IU ^ 

140 

Gin Leu Met Cys Glu Ser lie Ser Glu Leu Gin Ar« Lve a t 

150 Lye Aap Lys Ala 



756 



804 



852 



900 



950 



TATCATTACC AATATTATCT AATGCTTCTT ATCGTATATA TCCAGTCGAT ATGTAACCAT 
AATCTAACAT GTCTTAGTAC ATCCATTGTG GGGATACTTC CAAAAACCTT GTTT^TACC ,'„" 



1010 
70 



- 38 - 

Leu Gin Glu Gin Asn Asn Met Leu Ala Lys Lys Val Lys Glu Lys Glu 

i65 170 175 

Lys Ala Leu Ala Gin Gin Thr Gin Trp Asp Asn Pro Gin Asp Asp Gly 
180 185 190 

Leu Thr Ser Ser Ser Val lie Leu Ser Gin Ser Leu Gin Pro Leu Asn 
195 200 205 

lie Gly Gly Pro Tyr His Pro Ser Gly He Glu Glu Gly Ala Ala Leu 
210 215 220 

Gly Pro Pro Gin His Arg Asn Ala Thr Leu Phe Pro Ser Trp Met Leu 
225 230 235 " 240 

Ser His Leu Gin Glu 

245 

(2) INFORMATION FOR SEQ ID NO: 15: 

Ji) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ■ • ■ , 

(ii) MOLECULE TYPE : DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 38.. 724 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GCTTCCATCT CCAGGGAAGA GTTTTGAACT TGCGACC ATG GTT TTT CCA ACC CAA 5 5 

Met Val Phe Pro Thr Gin 
1 5 

GCC ACG CCC GAG GAG TCC CCG CAG AGG AAA ATG GGG AGG GGA AAG ATC 103 
Ala Thr Pro Glu Glu Ser Pro Gin Arg Lys Met Gly Arg Gly Lys He 
10 15 20 

GAG ATC AAG CGG ATC GAG AAC ACG ACG AAT CGG CAA GTG ACT TTC TGC 151 
Glu He Lys Arg He Glu Asn Thr Thr Asn Arg Gin Val Thr Phe Cys 
25 30 35 

AAG CGG CGG AAT GGC CTC CTC AAG AAG GCA TAT GAA CTC TCC GTT CTT 199 
Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala Tyr Glu Leu Ser Val Leu 
40 45 " 50 

TGC GAA CGC GAG GTC GCC CTC ATC GTC TTC TCC AGC CGC GGT CGC CTC 24 7 

Cys Glu Arg Glu Val Ala :,eu He Val Phe Ser Ser Arg Gly Arg Leu 
55 60 65 70 

TAT GAG TAT GCC AAC GAT AGT GTC AAA GCA ACC ATC GAG AGG TAC AAG 2 95 

T, . Glu Tyr Ala Asn Asp Ser Val Lys Ala Thr He Glu Arg Tyr Lys 

75 80 85 

AAG GCT TTC TCA GAT TCC TCC AGT AGC GGG ATC CGT TCT GAA GCT AAT 343 
Lys Ma Phe Ser Asp Ser Ser Ser Ser Gly He Arg Ser Glu Ala Asn 
90 95 100 

GTT CAG TTT TAT CAG CAA GAA TCC GCA AAG TTG CAA CAA CAG ATT AAT 391 
Val Gin Phe Tyr Gin Gin Glu Ser Ala Lys Leu Gin Gin Gin He Asn 
105 110 us 
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AAC ATG CAG AAC AAT AAC AGG CAA CTG GTG GGT GAC TCA ATT GCT GGG 43 9 

Asn Met Gin Asn Asn Asn Arg Gin Leu Val Gly Asp Ser lie Ala Gly 
120 125 130 

ATG AAT ATG AAG GAT ATG AAG ACT ACG GAG CAA AAA CTA GAA AAA GCA 487 
Met Asn Met Lys Asp Met Lys Thr Thr Glu Gin Lys Leu Glu Lys Ala 
135 140 145 150 

ATC GCT AAA ATT CGC GCC AAA AAG AAT GAG CTT CTA TTT GCG GAG ATT 53 5 

lie Ala Lys He Arg Ala Lys Lys Asn Glu Leu Leu Phe Ala Glu lie 

155 160 165 

GAG TAT ATG CAA AAA AGG GAA ATT GAC CTA CAC AAC AAC AAT CAA GTT 583 
Glu Tyr Met Gin Lys Arg Glu He Asp Leu His Asn Asn Asn Gin Val 
170 175 180 

CTG CGA GCG AAG ATC GCG GAG AGC GAG AGG TCC CAA CAT GCG GAT ATG 631 
Leu Arg Ala Lys He Ala Glu Ser Glu Arg Ser Gin His Ala Asp Met 
185 190 195 

AAT TTG ATG CCG GGA GGT ACC AAC TAT GAC TTC ATG CAG CCA TCT TCT 679 
Asn Leu Met Pro Gly Gly Thr Asn Tyr Asp Phe Met Glr Pro Ser Ser 
200 205 210 

TCT CAG CCT TTT GAT TCC AGA AAC TAC TTC CAA GTG AAC GTG CTG 724 
Ser Gin Pro Phe Asp Ser Arg Asn T*^r Phe Gin Val Asn Val Leu"*-* • 
215 220 * 225 

TAGCACCCAA CCACTGTGAG TGTGCACAAG TCTAGTATCG ATTGAAGGGA ATATAACTAG 784 

TATATGGAGC TTCTAAGTGC TTATTATGAG GTTATGATGG ACTTCATCAG ATGTTAATTA 844 

AGTAGAAGCA GTTGGCAGAT GCATCTATTA GGTGAAGGGC GTCCTCATCC TTTTGTTGAT 904 

TAGCATCCTT AGATGTCTTA AAATTTCCAC CATCTTTGTT AAGGTGTGTA TTTGGGTCCC 964 

TATGTACTAA ACTGAATATG AACTATCGAA CTA 997 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Val Phe Pro Thr Gin Ala Thr 
1 5 

Met Gly Arg Gly Lys He Glu He 
20 

Arg Gin Val Thr Phe Cys Lys Arg 
35 40 

Tyr Glu Leu Ser Val Leu Cys Glu 
50 55 

Ser Ser Arg Gly Arg Leu Tyr Glu 
65 70 



Pro Glu Glu Ser Pro Gin Arg Lys 
10 is 

Lys Arg He Glu Asn Thr Thr Asn 
25 30 

Arg Asn Gly Leu Leu Lys Lys Ala 

45 

Arg Glu Val Ala Leu He Val Phe 
60 

Tyr Ala Asn Asp Ser Val Lys Ala 
75 so 
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Thr He Glu Arg Tyr Lys Lys Ala Phe Ser Asp Ser Ser Ser Ser Gly 

85 90 95 

He Arg Ser Glu Ala Asn Val Gin Phe Tyr Gin Gin Glu Ser Ala Lys 
100 105 110 

Leu Gin Gin Gin He Asn Asn Met Gin Asn Asn Asn Arg Gin Leu Val 
115 120 125 

Gly Asp Ser He Ala Gly Met Asn Met Lys Asp Met Lys Thr Thr Glu 
130 . 135 140 

Gin Lys Leu Glu Lys Ala He Ala Lys He Arg Ala Lys Lys Asn Glu 
145 150 155 160 

Leu Leu Phe Ala Glu He Glu Tyr Met Gin Lys Arg Glu He Asp Leu 

165 170 175 

His Asn Asn Asn Gin Val Leu Arg Ala Lys He Ala Glu Ser Glu Arg 
180 185 190 

Ser Gin His Ala Asp Met Asn Leu Met Pro Gly Gly Thr Asn Tyr Asp 
195 200 205 

Phe Met Gin Pro Ser Ser Ser Gin Pro Phe Asp Ser Arg Asn Tyr Phe 
210 215 220 

Gin Val Asn Val Leu ' 
225 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 

1. An isolated nucleic acid molecule comprising a sequence of nucleotides encoding 
or complementary to a sequence encoding a product or derivative thereof required for 
normal flower development in a woody perennial. 

2. An isolated nucleic acid molecule according to claim 1 wherein the woody 
perennial is a species of the genus Eucalyptus. 

3. An isolated nucleic acid molecule according to claim 1 or 2 wherein the species 
of Eucalyptus is selected from Eucalyptus alpina, Eucalyptus annulaia, Eucalyptus 
osrringens, Eucalyptus behriana, Eucalyptus benthamii, Eucalyptus botryoides. 
Eucalyptus brockwayi, Eucalyptus caesia, Eucalyptus calophylla 'Rosea '. Eucalyptus 
calycogona, Eucalyptus camaldulensis> Eucalyptus camaldulensis ssp. subcinerea, 
Eucalyptus campaspe, Eucalyptus camphora, Eucalyptus citriodora. Eucalyptus 
cladocalyx, Eucalyptus cladocalyx 'Nana \ Eucalyptus cioeziana. Eucalyptus cornuta. 
Eucalyptus cosmophylla. Eucalyptus crenulata, Eucalyptus cruets. Eucalyptus 
delegatensis, Eucalyptus desmondensis, Eucalyptus dielsii, Eucalyptus diptera, 
Eucalyptus diversicolor, Eucalyptus diversifolia, Eucalyptus dumosa, Eicahptus 
dundasii, Eucalyptus dunnii, Eucalyptus eremophila, Eucalyptus erythrocorys. Eucalyptus 
erythronema, Eucalyptus ficifolia, Eucalyptus flocktoniae. Eucalyptus foecunda, 
Eucalyptus forrestiana, Eucalyptus gardnerU Eucalyptus gillii, Eucalyptus globulus, 
Eucalyptus globulus ssp globulus, Eucalyptus globulus ssp, pseudoglobulus, Eucalyptus 
globulus ssp maidenii, Eucalyptus globulus ssp bicostata. Eucalyptus gomphocephala, 
Eucalyptus gracilis, Eucalyptus grandis, Eucalyptus griffithsih Eucalyptus grossa t 
Eucalyptus henrii, Eucalyptus increassata t Eucalyptus kitsoniana. Eucalyptus 
kondininensis, Eucalyptus kruseana. Eucalyptus lansdowneana, Eucalyptus largiflorens, 
Eucalyptus lehmanni, Eucalyptus leucoxylon, Eucalyptus leucoxylon (Pink) (syn. 
'Rosea'), Eucalyptus lesouefii, Eucalyptus macrandra. Eucalyptus macrocarpa. 
Eucalyptus maculata. Eucalyptus marginata, Eucalyptus melliodora. Eucalyptus 
microcarpa, Eucalyptus nicholii, Eucalyptus nitens, Eucalyptus nutans. Eucalyptus 
obliqua, Eucalyptus occidentalism Eucalyptus ochrophloia. Eucalyptus oleosa, Eucalyptus 
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orbifolia, Eucalyptus ovata, Eucalyptus pauciflora, Eucalyptus parvifolia. Eucalyptus 
perriniana, Eucalyptus pileata t Eucalyptus pellita. Eucalyptus pilularis, Eucalyptus 
platypus, Eucalyptus polyanthemos, Eucalyptus porosa, Eucalyptus preissiana. 
Eucalyptus pyriformis, Eucalyptus regnans, Eucalyptus resinifera, Eucalyptus rhodantha, 
Eucalyptus robusta. Eucalyptus saligna. Eucalyptus salmonophloia, Eucalyptus salubris, 
Eucalyptus sargentii, Eucalyptus sepulcral is, Eucalyptus sideroxylon 'Rosea \ Eucalyptus 
sieberi, Eucalyptus spathulata, Eucalyptus socialis, Eucalyptus steedmanii, Eucalyptus 
stoatei, Eucalyptus stricklandii, Eucalyptus tereticornis, Eucalyptus tetragona. 
Eucalyptus tetraptera. Eucalyptus torquata, Eucalyptus Torwood', Eucalyptus urnigera. 
Eucalyptus urophylla, Eucalyptus viminalis, Eucalyptus viridis, Eucalyptus wandoo. 
Eucalyptus websteriana, Eucalyptus woodwardii. 

4. An isolate nucleic acid molecule according to any one of claims 1 to 3 . wherein 
the species of Eucalyptus is Eucalyptus benthamii, Eucalyptus botryoides, Eucalyptus 
camaldulensis. Eucalyptus citriodora. Eucalyptus cloeziana. Eucalyptus delegatensis, 
Eucalyptus diversicolor, Eucalpytus dunnii, Eucalyptus globulus. Eucalyptus globulus 
subspecies bicostata. Eucalyptus globultu subspecies globulus, Eucalyptus globulus 
subspecies maidenii, Eucalyptus globulus subspecies pseudoglobulus. Eucalyptus grandis. 
Eucalyptus henrii, Eucalyptus maculata. Eucalyptus marginata, Eucalyptus nitens, 
Eucalyptus obliqua, Eucalyptus pellita. Eucalyptus pilularis. Eucalyptus regnans. 
Eucalyptus resinifera, Eucalyptus saligna. Eucalyptus sieberi. Eucalyptus tereticornis. 
Eucalyptus urophylla. Eucalyptus viminalis. 

5. An isolated nucleic acid molecule according to any one of ciaims 1 to 4 wherein 
the species of Eucalyptus is Eucalyptus globulus ssp globulus or Eucalyptus nitens. 

6. An isolated nucleic acid molecule according to any one of claims 1 to 5 wherein 
the nucleic acid is DNA. 

7. An isolated nucleic acid molecule according to any one of claims 1 to 6 wherein 
the nucleotide sequence corresponds to a gene selected from leafy (LFY) f apetala 1 
(API) or agamous (AG) from Arabidopsis or floricaula (FLO), squamosa (SQUA) or 
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plena from Antirrhium (PLE). 

8. An isolated nucleic acid molecule according to any one of claims I. to 7 wherein 
the nucleic acid molecule is selected from the following: 

(i) comprises a nucleotide sequence set forth in SEQ ID NO:l or 3 or 5 and is 
analogous to LFY and/or FLO; 

(ii) comprises a nucleotide sequence set forth in SEQ ID NO:9 or 1 1 or 13 and is 
analogous to API and/or SQVA; or 

(iii) comprises a nucleotide sequence set forth in SEQ ID NO:7 or 15 and is 
analogous to AG and/or PLE. 



9. An isolated DNA molecule of eucalypt origin encoding a reproductive gene and 
which is capable of hybridising under low- stringency conditions to a nucleic acid 
molecule having a nucleotide sequence as set forth in or complementary to SEQ ID 
NO. l or 3 or 5. 



1 0. An isolated DNA molecule of eucalypt origin encoding a reproductive gene and 
which is capable of hybridising under low stringency conditions to a nucleic acid 
molecule having a nucleotide sequence as set forth in or complementary to SEQ ID 
NO:9 or 11 or 13. 

11. An isolated DNA molecule of eucalypt origin encoding a reproductive gene and 
which is capable of hybridising under low stringency conditions to a nucleic acid 
molecule having a nucleotide sequence as set forth in or complementary to SEQ ID 
NO:7orl5. 

12. A method of minimising inflorescence in a woody perennial, said method 
comprising introducing into one or more cells of said wood perennial a genetic construct 
capable of down-regulating or up-regulating expression of a reproductive gene in said 
woody perennial, regenerating a transgenic plant from said one or more cells and 
growing said plant for a time and under conditions sufficient to permit expressio; of said 
genetic construct. 
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!3. A method according to claim 12 whereir the woody perennial is a species of the 
genus Eucalyptus. 

14. A method according to claim 13 wherein the species of Eucalyptus is selected 
from Eucalyptus alpirta. Eucalyptus annulata, Eucalyptus astringens. Eucalyptus 
behriana, Eucalyptus benthamii, Eucalyptus botryoides, Eucalyptus brockwayi, 
Eucalyptus caesia, Eucalyptus calophylla 'Rosea \ Eucalyptus calycogona. Eucalyptus 
camaldulensis, Eucalyptus camaldulensis ssp. subcinerea, Eucalyptus campaspe. 
Eucalyptus camphora. Eucalyptus citriodora, Eucalyptus cladocalyx, Eucalyptus 
cladocalyx 'Nana Eucalyptus cloeziana. Eucalyptus cornuta. Eucalyptus cosmophylla t 
Eucalyptus crenulata, Eucalyptus cruets, Eucalyptus delegatensis. Eucalyptus 
desmondensis, Eucalyptus dielsii, Eucalyptus diptera, Eucalyptus divers ieolor, Eucalyptus - 
diversifolia, Eucalyptus dumosa, Eucalyptus dundasii, Eucalyptus duhhii, Eucalyptus 
eremophila, Eucalyptus erythrocorys. Eucalyptus erythronema. Eucalyptus ficifolia. 
Eucalyptus flocktoniae, Eucalyptus foecunda, Eucalyptus forrestiana, Eucalyptus 
gardneri, Eucalyptus gillii, Eucalyptus globulus, Eucalyptus globulus ssp globulus. 
Eucalyptus globulus ssp. pseudoglobulus, Eucalyptus globulus ssp maidenii. Eucalyptus : 
globulus ssp bicostata, Eucalyptus gomphocephala. Eucalyptus gracilis. Eucalyptus 
grandis. Eucalyptus griffithsii. Eucalyptus grossa, Eucalyptus henrii. Eucalyptus 
increassata, Eucalyptus kitsoniana, Eucalyptus kondininensis. Eucalyptus kruseana. 
Eucalyptus lansdowneana. Eucalyptus largiflorens. Eucalyptus lehmanni. Eucalyptus 
leucoxylon, Eucalyptus leucoxylon (Pink) (syn. Rosea Eucalyptus lesouefii, Eucalyptus 
macrandra, Eucalyptus macrocarpa. Eucalyptus maculata. Eucalyptus marginata. 
Eucalyptus melliodora, Eucalyptus microcarpa. Eucalyptus nicholii. Eucalyptus nitens. 
Eucalyptus nutans, Eucalyptus obliqua, Eucalyptus occidentalism Eucalyptus ochrophloia 
Eucalyptus oleosa. Eucalyptus orbifolia. Eucalyptus ovata, Eucalyptus pauciflora. 
Eucalyptus parvifolia. Eucalyptus perriniana. Eucalyptus pileata. Eucalyptus pellita. 
Eucalyptus pilularis f Eucalyptus platypus. Eucalyptus polyanthemos, Eucalyptus porosa. 
Eucalyptus preissiana. Eucalyptus pyriformis. Eucalyptus regnans. Eucalyptus resinifera. 
Eucalyptus rhodantha. Eucalyptus robusta, Eucalyptus saligna. Eucalyptus 
salmonophloia, Eucalyptus salubris. Eucalyptus sargentii, Eucalyptus sepulcralis. 
Eucalyptus sideroxylon Rosea \ Eucalyptus sieberi. Eucalyptus spathulata. Eucalyptus 
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socialis. Eucalyptus steedmanii, Eucalyptus stoatei, Eucalyptus stricklandii. Eucalyptus 
tereticornis, Eucalyptus tetragona, Eucalyptus tetraptera. Eucalyptus torquata. 
Eucalyptus Torwooa", Eucalyptus urnigera. Eucalyptus urophylla. Eucalyptus -iminalis. 
Eucalyptus viridis, Eucalyptus wandoo. Eucalyptus websteriana. Eucalyptus woodwardii. 

15. A method according to claim 14 wherein the species of Eucalyptus is selected 
from Eucalyptus benthamii, Eucalyptus botryoides, Eucalyptus camaldulensis. Eucalyptus 

citriodora. Eucalyptus cloeziana. Eucalyptus delegatensis, Eucalyptus diversicolor, 

Eucalpytus dumii. Eucalyptus globulus, Eucalyptus globulus subspecies bicostata. 

Eucalyptus globulus subspecies globulus. Eucalyptus globulus subspecies maidenii. 

Eucalyptus globulus subspecies pseudoglobulus. Eucalyptus grandis. Eucalyptus henrii. 

Eucalyptus maculata. Eucalyptus marglnata. Eucalyptus niter*. Eucalyptus obliqua. 

Eucalyptus pellita. Eucalyptus pilularis. Eucalyptus regnans. Eucalyptus resinifera. 

Eucalyptus saligna, Eucalyptus sieberi. Eucalyptus tereticornis. Eucalyptus urophylla. 

Eucalyptus viminalis. 

16. A method according to any one of claims 12 to 15 wherein the species of 
Eucalyptus is Eucalyptus globulus ssp globulus or Eucalyptus nitens. 

17. A method according to any one of ciaims 1 2 to 1 6 wherein the reproductive gene 
corresponds to one or more of SEQ ID N0:1 or 3 or 5, SEQ ID NO:9 or 1 1 or 13 
and/or SEQ ID NO:7 or 15. 



18. A method according to any one of claims 1 2 to 1 7 wherein the resulting plant is 
substantially sterile. 

19. A transgenic woody perennial plant which exhibits a substantial reduction in 
reproductive capability. 

20. A transgenic plant according to claim 19 wherein said plant is substantially 
sterile. 
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21. A transgenic plant according to claim 19 or 20 wherein said wood perennial is 
a species of Eucalyptus. 

27. A transgenic plant according to claim 21 wherein the species of Eucalyptus is 
selected from Eucalyptus alpina. Eucalyptus annulata. Eucalyptus astringens. Eucalyptus 
behriana, Eucalyptus benthamii, Eucalyptus botryoides. Eucalyptus brockwayi. 
Eucalyptus caesia. Eucalyptus calophylla Rosea '. Eucalyptus calycogona. Eucalyptus 
camaldulensis, Eucalyptus camaldulensis ssp. subcinerea, Eucalyptus campaspe. 
Eucalyptus camphora, Eucalyptus citriodora, Eucalyptus cladocalyx. Eucalyptus 
cladocalyx 'Nana \ Eucalyptus cloeziana. Eucalyptus cornuta. Eucalyptus cosmophylla. 
Eucalyptus crenulata, Eucalyptus cruris, Eucalyptus delegatensis. Eucalyptus 
desmondensis, Eucalyptus dielsii. Eucalyptus diptera, Eucalyptus diver sicolor t Eucalyptus 
diversifolia. Eucalyptus dumosa. Eucalyptus dundasii. Eucalyptus dunnil Eucalyptus 
eremophila, Eucalyptus erythrocorys. Eucalyptus erythronema. Eucalyptus fici/elia. 
Eucalyptus flocktoniae, Eucalyptus foecunda. Eucalyptus forrestiana. Eucalyptus 
gardneri, Eucalyptus gillii. Eucalyptus globulus. Eucalyptus globulus ssp globulus. 
Eucalyptus globulus ssp. pseudoglobulus. Eucalyptus globulus ssp maidenii. Eucalyptus 
globulus ssp bicostata. Eucalyptus gomphocephala. Eucalyptus gracilis. Eucalyptus 
grandis, Eucalyptus griffithsii. Eucalyptus grossa. Eucalyptus henrii. Eucalyptus 
increassata. Eucalyptus kitsoniana. Eucalyptus kondininensis. Eucalyptus kruseana. 
Eucalyptus lansdowneana. Eucalyptus largiflorens. Eucalyptus lehmanni. Eucalyptus 
leucoxylon t Eucalyptus leucoxylon (Pink) (syn Rosea ), Eucalyptus lesouefii. Eucalyptus 
macrandra. Eucalyptus macrocarpa. Eucalyptus maculata. Eucalyptus marginata. 
Eucalyptus melliodora. Eucalyptus microcarpa. Eucalyptus nicholii. Eucalyptus nitens. 
Eucalyptus nutans. Eucalyptus obiiqua. Eucalyptus occidental is. Eucalyptus ochrophloia. 
Eucalyptus oleosa. Eucalyptus orbifolia. Eucalyptus ovata. Eucalyptus paucijlora. 
Eucalyptus parvifolia. Eucalyptus perriniana. Eucalyptus pileata. Eucalyptus pellita. 
Eucalyptus pilularis. Eucalyptus platypus. Eucalyptus polyanthemos. Eucalyptus porosa 
Eucalyptus pretssiana. Eucalyptus piriformis. Eucalyptus regnans. Eucalyptus resinifera. 
Eucalyptus rhodantha. Eucalyptus robusta. Eucalyptus saligna. Eucalyptus 
salmonophloia. Eucalyptus salubris. Eucalyptus sargentii. Eucalyptus sepulcralis. 
Eucalyptus sideroxylon Rosea '. Eucalyptus sieberi. Eucalyptus spathulata. Eucalyptus 
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socialis. Eucalyptus steedmanii. Eucalyptus stoatei. Eucalyptus stricklandii, Eucalyptui 
tereticornis, Eucalyptus tetragona. Eucalyptus tetraptera. Eucalyptus torquata. 
Eucalyptus Tonvood', Eucalyptus urnigera. Eucalyptus wophylla. Eucalyptus viminalis. 
Eucalyptus viridis. Eucalyptus wandoo. Eucalyptus we bsteriana. Eucalyptus woodwardii. 



23. A transgenic plant according to claim 22 wherein the species of Eucalyptus is 
selected from Eucalyptus benthamii. Eucalyptus botryoides, Eucalyptus camaldulensis. 
Eucalyptus citriodora, Eucalyptus cloeziana. Eucalyptus delegatensis. Eucalyptus 
diversicolor, Eucalpytus dunnii. Eucalyptus globulus. Eucalyptus globulus subspecies 
bicostata. Eucalyptus globulus subspecies globulus, Eucalyptus globulus subspecies 
maidenii. Eucalyptus globulus subspecies pseudoglobulus, Eucalyptus grandis. 
Eucalyptus henrii. Eucalyptus maculata. Eucalyptus marginata. Eucalyptus nitens. 
Eucalyptus obliqua. Eucalyptus pellita, Eucalyptus pilularis. Eucalyptus regnans. 
Eucalyptus resinifera, Eucalyptus saligna. Eucalyptus sieberi. Eucalyptus tereticornis, 
Eucalyptus urophylla, Eucalyptus viminalis. 

24. A transgenic plant according to any one of claims 19 to 23 wherein the species 
of Eucalyptus is Eucalyptus globulus ssp globulus or Eucalyptus nitens. 

25. A method for identifying a gene required for normal flower development in a 
woody perennial, said method comprising isolating a nucleic acid molecule from said 
wood perennial and cloning a region of said nucleic acid molecule having at least 45% 
nucleotide similarity to all or a region of Arabiopsis genes LFY, API or AG or 
Antirrhinium genes FLO. SQUA or PLE. 

26. A method according to claim 25 wherein the woody perennial is a species of the 
genus Eucalyptus. 



27. A method according to claim 26 wherein the species of Eucalyptus is selected 
from Eucalyptus alpina. Eucalyptus annulata. Eucalyptus astringens. Eucalyptus 
behriana. Eucalyptus benthamii. Eucalyptus botryoides. Eucalyptus brockwayi. 
Eucalyptus caesia. Eucalyptus calophylla Rosea'. Eucalyptus calycogona. Eucalyptus 
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camaldulensis, Eucalyptus camaldulertsis ssp. subcinerea. Eucalyptus campaspe, 
Eucalyptus camphora, Eucalyptus citriodora. Eucalyptus cladocalyx. Eucalyptus 
cladocalyx *Nana \ Eucalyptus cloeziana, Eucalyptus comuta. Eucalyptus czzmophylla, 
Eucalyptus crenulata. Eucalyptus cruris. Eucalyptus delegatensis, Eucalyptus 
desmondertsis, Eucalyptus dielsii t Eucalyptus diptera, Eucalyptus diver sicolor. Eucalyptus 
diversifolia. Eucalyptus dumosa, Eucalyptus dundasil Eucalyptus dunnil Eucalyptus 
eremophila, Eucalyptus erythrocorys, Eucalyptus erythronema, Eucalyptus ficifolia. 
Eucalyptus flocktoniae, Eucalyptus foecunda, Eucalyptus forrestiana, Eucalyptus 
gardneri, Eucalyptus gillii, Eucalyptus globulus, Eucalyptus globulus ssp globulus, 
Eucalyptus globulus ssp. pseudoglobulus, Eucalyptus globulus ssp maidenii. Eucalyptus 
globulus ssp bicostata. Eucalyptus gomphocephala. Eucalyptus gracilis. Eucalyptus 
grandis, Eucalyptus griffithsii. Eucalyptus grossa. Eucalyptus henrii, Eucalyptus 
increassata, Eucalyptus kitsoniana, Eucalyptus kondininensis.. Eucalyptus kruseajaa. 
Eucalyptus lansdowneana. Eucalyptus largiflorens. Eucalyptus lehmanni, Eucalyptus 
leucoxylon, Eucalyptus leucoxylon (Pink) (syn. Rosea'), Eucalyptus lesouefii, Eucalyptus 
macrandra. Eucalyptus macrocarpa. Eucalyptus maculata t Eucalyptus marginata. 
Eucalyptus melliodora. Eucalyptus microcarpa, Eucalyptus nicholii, Eucalyptus nitens. 
Eucalyptus nutans. Eucalyptus obliqua, Eucalyptus occidentalis. Eucalyptus ochrophloia. 
Eucalyptus oleosa, Eucalyptus orbifolia. Eucalyptus ovata, Eucalyptus pauciflora. 
Eucalyptus parvifolia. Eucalyptus perriniana. Eucalyptus pileata, Eucalyptus pellita. 
Eucalyptus pilularis t Eucalyptus platypus, Eucalyptus polyanthemos, Eucalyptus porosa. 
Eucalyptus preissiana, Eucalyptus pyriformis, Eucalyptus regnans. Eucalyptus resinifera. 
Eucalyptus rhodantha. Eucalyptus robusta, Eucalyptus saligna, Eucalyptus 
salmonophloia. Eucalyptus salubris. Eucalyptus sargentii. Eucalyptus sepulcralis 
Eucalyptus sideroxytlon 'Rosea \ Eucalyptus sieberi. Eucalyptus spathulata. Eucalyptus 
socialis t Eucalyptus steedmanii. Eucalyptus stoatei t Eucalyptus stricklandii. Eucalyptus 
tereticornis. Eucalyptus tetragona, Eucalyptus tetraptera. Eucalyptus torquata. 
Eucalyptus Torwood ', Eucalyptus umigera. Eucalyptus urophylla. Eucalyptus viminalis. 
Eucalyptus viridis. Eucalyptus wandoo. Eucalyptus websteriana. Eucalyptus woodwardii. 
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28. A method according to claim 27 wherein the species of Eucalyptus is selected 
from Eucalyptus benthamil Eucalyptus botryoides. Eucalyptus camaldulertsis. Eucalyptus 
citriodora t Eucalyptus cloeziana, Eucalyptus deiegatensis. Eucalyptus diversicolor, 
Eucalpytus dunniU Eucalyptus globulus, Eucalyptus globulus subspecies bicostata, 
Eucalyptus globulus subspecies globulus. Eucalyptus globulus subspecies maidemi. 
Eucalyptus globulus subspecies pseudoglobulus, Eucalyptus grandis, Eucalyptus henrii. 
Eucalyptus maculata. Eucalyptus marginata, Eucalyptus nitens. Eucalyptus obliqua. 
Eucalyptus pellita. Eucalyptus pilularis, Eucalyptus regnans, Eucalyptus resinifera. 
Eucalyptus saligna, Eucalyptus sieberi, Eucalyptus tereticornis, Eucalyptus urophylla. 
Eucalyptus viminalis, 

29. A method according to any one of claims 25 to 28 wherein the species of 
Eucalyptus is Eucalyptus globulus ssp globulus o- Eucalyptus nitens. 

30. An isolated nucleic acid molecule according to any one of claims 1 to 1 1 or a 
method according to any one of claims 12 to 18 or a transgenic woody perennial 
according to any one of claims 19 to 24 or a method according to any one of claims 25 
to 29 substantially as hereinbefore described with reference to the Figures and/or 
Examples. 
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ABSTRACT 



The present invention relates generally to plants and more particularly to woody 
perennial plants such as eucalypts with reduced reproductive capacity. The present 
invention is also directed to genetic constructs useful in the generation of woody 
perennial plants with reduced reproductive capacity. The present invention is most 
particularly directed to sterile tree* of the genus Eucalyptus which are useful in the 
establishment of wood lot plantations, in re-forestation projects and in developing 
modified eucalyptus trees with beneficial properties. 



-120 CCACTACGTACCTACATACACTGTACATTCAATATACTAAACACATCCACCC^TCTCTTT 

-60 ATAAAACAAGCAACCCACACTCCTCTACCCTTACACCCTCAACCCATTTGAACCCCGACA 

0 ATGGATCCAGAACCATTTGCGGTTGTGGGGTTGCGAACGATGCGGGGACTCGAGGAGCTG 

0 MDPEAFAVVGLRTMCGLEEL 

61 TTCGAGCCTTATGGCATAAGGTACCTCACGGCCTCCAGGATAGCCCAAATGGGGTTTACC 

21 FEAYG I RYLTASR I AEMC.^T 

1 2 1 CCCAACACCCTCCTCG ACATGAAGGAGGACGACCTCGACCACATCATCAACTCCCTCTCC 

41 ANTLLDMKEEELDDMMNSLS 

181 CACATCTTCCCCTCGGACCTCCTCCTCCCCGACCCCTACGCCATCAAGGCCGCCATCCCC 

61 HIFRWDLLVCERYCIKAAIR 

241 GCCCAGCGCCCACCCCTCCTCCAAGCCGATGACCGCCGCCACCACCTCCACTCCACCCAC 

81 AERRRLLEADDRRHHLHSTD 

301 CATCCCCTCCTCCATGCTCTCTCCCACCAAGGTACCTTACCTCGGCCTCTTTCATTTTTC 

101 HALLDALSHQC 

361 ACCTTTTGGGTCATTATGATTATTCCCACGCAGGGCTGTCGGAGGAACAAGTGGTCCACC 

112 LSEEQVVQH 

421 ACTCACACAACCATCACCTCCCCACGCCCCCAACCCCCGACACCCCCGCCACC TCCTCCC 

121 SEKDQLCRACSCDTACTSWC 

481 CCCCCCAACAACAGAGAAACAAGCATCGTCATCCTCACCACATCACCGCGATCAAAGCAG 

141 AQQQR-CKHRHRHHITAMKGA 

641 CGGCCACGGAAGAGGACGAGGAGGACCAGGAGCAAGTGGAGGAGATGAGGAGGCAGAGGC 

161 ATEEDEEDEEEVEEMRRQRE 

701 ACCACCCCTTCATACTCACCCACCCrCCCGACCTCCCCCCTCCGAACAACAACCGCCTCC 

181 HPF I VTEPGEVARGKKNGLD 

7 6 1 ACTACCTCTTCC ATCTCTACGACC AGTGCCGCGACTTCCTCCTCCAACTCCAATCCTTGC 

201 YLFHLYDQCRDFLLQVQSLA 

82.1 CCAAGGAGCGCCGCGACAAATGCCCCACCAACCTCCTCCGCCTCTCTTCTTTTTATAGTC 

221 KERCEKCPTK 

881 - TTTCGATAC AATC AAACTTCTTCGAACGTACCCAAGCACGCATCATTCAAACAACAAGAC 

941 AATCCACGTGACGAACCAGCTCTTCAGGTACGCGAAGAAGGCGGGAGCAAGCTACATAAA 

231 VTNQVFRYAKKAGASY I N 

1001 CAAGCCCAAGATGAGGCACTATGTCCACTCCTACGCCCTGCACTGCCTCCACCAGCACCC 

249 K P K M B H Y V H C V A LHCLDEHA 

LFY-309-H 

1061 CTCCAACGCCCTTCCCAACACCTTCAACGAGCGCCCCGAGAACCTCGGCGCCTCCACCCA 

269 S NALRKSFKERCENVGA W R Q 

1121 AGCCTCTTACCACCCCCTGCTCACCATCCCCGCCCGCAGGGCCCGCTGGGACATCGACGC 

289 A C 1 U E_ L. VTIAGRRAGWD I DA 

LFY-351-J 

1181 C ATCTTCAATGCCC ACCCCCGCCTCTCC ATCTCGTATCTCCCCACC AAGCTCCGCCAGCT 

309 I FNAHPRLCIWYVPTKLRQL 

1241 CTGCCATGCTCACCCCCACTCCTCCGCCTCTGCTGCTTCCTCCGCTTCCACCTCCACCTC 

329 CHAHRHSSASAASSASTSTS 

1301 TCCCCCCACCCCTCACCATCTCCAACTCCCTTACTAGTTCGTGCCCCTTCCTCATTCTCT 

349 APTAHHLELPY stop 359 

1361 TCTCTTTTACTCTCCTATC ACCCC ATC ATC ATG AGCG AGATGAAAGTACTCTGCTGTGGT 

1441 CTATTTCTACTGCAATGAGTATTAATTGAATTTTAACGTGTAGCTGAGCTGAGCTGAGCT 
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CGCCCCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTTCAAGAACACACCAAAGAAC 

ACACACCCATCTTAACCTCTTACTATCTCCACTCTTATTCTTTTCATATTTTTCTCCAAA 

CCATCCTCGTCCCrAAnCTTTTTTTCCCCCACTATCCACTTCCCCAGTCAA^TTTCACAC 

MEFPSEFSE 9 

CCCTCTTCACAGAAGAGAATCGGCGCGAGACCCAAAATAGAGATCAAACCCATCCAGAAC 
ASSQKRIG G R G K I E I K B I E_U . 29 

ACGACGAACCCGCAGGTCACCTTTTrTAAACGCCCGAACGGCTTGTTCAACAAGCCTTAT 
TTNRQVTFC K EL R N G LL KK A 1 ^9 

cagctatccctcttgtgcgatcctcaag;cccccttattctcttctccacccgtgccacc 

El. SVf.CnAEVALIVFSS B G & 69 

CTCTATGAATATGCTAACAACACTC i L> \GA 3C AAC AATTCAGAGCTACAAGAAACC AACC 

I. Y F Y ANNSV IGTI ERYKKAS 89 

ACTCATTCCTCACATCCACACTCCCTTTCTCAAGTCAACACTCACTTTTATCACCAACAA 
SDSSHPQSVSEVNTQFYQQE 109 

GCATCCAAGCTTCCGACACAGATAAGACAAATCCAGGTCTCAAATAGCCATATTCTAGGT 
ASKLRRQ I REIQVSNRHI L 0 .29 

GACCCTATAACTCATTTCACCTTCAACCATCTCAACAATCTCCACACCAAATTAGAGAAA 
EC ISDLSFKDLKNLESKLEiC i-9 

TCCATCACCCGACTTACATCAAACAACAATCACATCCTTTTTGCCGACATTCACTACATC 

S I SRVRSKKNEMLFAEIEYM 169 

CACAACAGGGAAATTCAGCTGCAAAATCATAACATGTATCTCAGAGCAAAGATAGCTGAG 
QKREI ELQNDNMYLRAK I A E 189 

AC- 1 

AACCACACACCACAACACCACCACCACCAACCGACTCATCATCACTTCAACATCCCCCCA 

M F R AQQQQQQGSDHHFNMPC 209 

TCCTCGTCGGTCTACGACGCCCTCCCGTCTCACCCTCCATACCATCGCAACTTCCTCCAA 
SSSVYEALPSQPAYDRNFLQ 229 

CTCAATGTCCTCCAACCAAATCACCAATCTTATTCTCGATCCGACCACACTGCCCTCCAA 
VNVLEPNHQSYSRS D H T A L Q 2u9 

AG-2 

CTCCTCTAATTAAAGCTCACTTAATATGTAGCACTACCAACTGGACCTATTGCCACTCTA 
L V stop 251 

GATTATCCTTTTTAATATCCTACACTTAGTTGAAAGACGAATCAATGAACTGCCGACAGA 
CCCGCATCATGAGTTTCCCCTAAACAGTGGAAGAACTCTCTCATATTTCTCTTCATCACT 
CTTCAAACTCTTCTCATCAATCCTCTTCTACAACCGTTATTCCAATTTTTATATACTACT 
TAA 
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TAACCCCCCATCTCTCTTCTCTCTCTCTCTCTCIC.XTCCCTTTCTCTTTTCTTTTCTTr 

CTTCCCCTTTTCCCGTCTCTCTTCCCrTCTCT^TICCACCCACCAT^CCCCCAC 

M G B. C q 

^ Lj^O^r^if^o^^^r^^^^^^^^CACAACTCA CCTTCTCCAAGACCAC 

AATTCTCCMCCCTATCAAACATACTCATATCCCCAC^ 
CGAATCCA™CTAGCTOGACmcCAGCATGCT r c^^ 

^H*^R^*^^*^g^^*^p^jJ^^^^GATCTTCATTCT^CACTCTC^CCACC 

c ^7 Tc rrrrTT'A =c T iT r„" c f4^T o r T 5' 

CCACCACCAACACCATGCCCTTCACTCACCTCTTCnp 
TCACCCTCCCCCTCCTACACTTCTTCCTCCOT^^ 

acataaaacgcmcitctgctgctatatctttgataaccm^tLLt^a™ 
attccatcgcataatctcaccctctcccccacccttcttctaccgaaataaacatgtccc 
tgctttttgctgcattcagtactcctaagtgtcctcctccctctatgtataccaccattc 
caaatatgcataccaatgtacatgtatgcaaaattatatatgcaagttaacacaaccata ' 

AAAA 



CT 
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tcgaacagttcagaaagacggat?.gatacatacacagagacatcgattgcatcatcttct 

ctacctcattgtgcgtgcggtgcgtgcggtgcgtgctttatatagacgctcgctcttacg 

ttgttctcgtccgcccggtggtgtggtggttggagagagagggagagagagtagagagag 

aa aATGGGGAGGGGGAGGGTGC AGCTGAAGAGGATAGAGAAC AAGATAAAC AGGC AAGTG 
MGRGRVQLK RIENKINRQV 

ACGTTCTCGAAGCGGAGATGCGGGCTGCTGAAGAAGGCCCACGAGATCTCCGTCCTCTGC 
TFSKRRCGLLKKAHEISVLC 

GACGCCGACGTCGCCCTCATCGTCTTCTCCACCAAGGGCAAGCTCTTCGAGTACGCCACC 
DADVAL IVFSTKGKLFEYAT 

GACTGTTGCATGGAGAGGATCCTCGAGCGTTATGAGAGATATTCATATGCAGAGAGCCAG 
DCCMERILERYERYSYAESQ 

GTTCTCACAAACAATGCCGAAACCAATGGGAACTGGACTTTGGAACATGCAAAACTCAAG 
V L T N N A E T N G NWT L E H A K'L K 

GCCAGGATGGAGATTCTGCAGAAAAATCAAAAGAATCTGATGGGAGAAGAACTCGATTCT 
ARME ILQKNQKNLMGEELDS 

CTAAGCCTCAAAGAGCTTCAGAATTTGGAGCATCAGCTTGACACTGCTCTTAAGAACATT 
LSLKELQNLEHQLDTA LKNI 

CGGTCTAGAAAGATTCAACTCATGTGTGAATCCATCTCTGAGCTTCAGAGAAAGGATAAG 
RSRKIQLMCESISELQRKDK 

GCGCTGCAAQAACAAAACAACATGCTGGCAAAGAAGGTGAAAGAGAAGGAGAAGGCACTG 
ALQEQNNMLAKKVKEK EKAL 

GCACAGCAAACTCAATGGGACAACCCGCAAGATGATGGCCTCACCTCTTCCTCGGGGTCC 
AQQTQWDNPQDDGLTS.SSGS 

TTACCATCCGAGTGGAATTGAggagggagcggccctcggtccacctcaacatcgtaatgc 
L P S E W N * 

caccctttttccttcatggatgcttagccacctccaagcctaattgtccaatttatttgc 
atgggaaccatatatcattaccaatattatctaatgcttcttatcgtatatatccagtcg 
atatgtaaccataatgtaacatgtcttagtacatgcattgtggggatacttgcaaaaacc 

ttgttttgtag'g 
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tcgaacagttcagaaagacggatagatacatacacagagacatcgattgcatcatcttct 

ctacctcattgtgcgtgcggtgcgtgcggtgcgtgctttatatagacgctcgctctcacg 

ttgttctcgtccgcccggtggtgtggtggttggagagagagggagagagagtagagagag 

aaaATC^AGGG^AGGGTGCAGCTGAAGAGGATAGAGAACAAGATAAACAGGCAAGTG 
MGRGRVQLKRIENKINRQV 

ACGTTCTCGAAGCGGAGATGC^GCTGCTGAAGAAG^CCACGAC^TCTCCGTCCTCT^ 
rSKRRCGLLKKAHEISVLC 

GACGCCGACGTCGCCCTCATCGTCTTCTCCACCAAGGGCAAGCTCTTCGAGTACGCCACf™ 
"ADVALIVFSTKGKLFEYAT 

GACTGTTGCATGGAGAGGATCCTCGAGCGTTATGAGAGATATTCATATGCAGAGAGCCAG 
DCCMERILERYERYSYAESQ 

^CTCACAAACA^ 
VLTNNAETNGNWTLEHAK LK 

GCCAGGATGGAGATTCTGCAGAAAAATCAAAAGAATCTGATGGGAGAAGAACTCGATTCT 
ARMEILQKNQ KNLMGEELDS 

CTAAGCCTCAAAGA^ 
^SLKE LQNLEHQLDTALKNI 

CGGTCTAG^^CAACTCATGTGTGAATCCATCTCTGAGCTTCAGAGAAA^ 
"SRKIQLMCESISELQRKDK 

GCGCTGCAAGAACAAAACAACATGCTGGCAAAGAAGGTGAAAGAGAAGGAGAAGGCACTG 
. .. . . .f? L A K K V K E K E K A L 

^ AC ^^ TC ^ T ^ CAA CCC«:AAGATC^TG<SCCTCACCTCTTCCTC(SGrrA^ 
AQQTQWDNPQDDGLTSSSV I 

CTCTCTCAATCITTGCAGCCTCTCAACATTGG^TC™^ 
^fU&LQP LNIGGPYHPSGIE 

GAGG^GCGGCCCTCGCTCCACCTCAACATCGTAATGCCACCCTTTTTCCTTCATGGATG 
^AALGPPQHRNATLF PSWM 

CTTAGCCACCTCCAAGAATAAtgatccaatttatttgcatggga 



accatatatcattacc 



gtcttagtacatgcattgtggggatacttgcaaaaaccttgttttgtagg 



aacat 
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GGAATTCGCG GCCGCTGAAA GGGCATTCGT TTCATTAATT CATGTCTCCG CTCATCTCAA 60 

CTCGAACAGT TCAGAAAGAC AGATAGATAC ATACACAGAG ACAT^GATTG CATCATCTTC 120 

TCTACCTCAT TGTGwTCiCG GTGCGTGCGG TGCGTGCTTT ATATAGACGC TCGCTCTTAC 180 

GTTGTTCTCG TCCGCCCGGT GGTGTGGTGG TTGGAGAGAG AGGGAGAGAG AGTAGAGAGA 24 0 

GAAA ATG GGG AGG GGG AGG GTG CAG CP" AAG AGG ATA GAG AAC AAG ATA 289 
Met Gly Arg Gly Arg Val Gin Leu Lys Arg lie Glu Asn Lys He 
15 10 15 

AAC AGG CAA GTG ACG TTC TCG AAG CGG AGA TGC GGG CTG CTG AAG AAG 337 
Asn Arg Gin Val Thr Phe Ser Lys Arg Arg Cys Gly Leu Leu Lys Lys 

20 25 30 

GCC CAC GAG ATC TCC GTC CTC TGC GAC GCC GAC GTC GCC CTC ATC GTC 385 
Ala His Glu He Ser Val Leu Cys Asp Ala Asp Val Ala Leu He Val 
35 40 45 

TTC TCC ACC AAG GGC AAG CTC TTC GAG TAC GCC ACC GAC TGT TGC ATG 4 33 

Phe Ser Thr Lys Gly Lys Leu Phe Glu Tyr Ala Thr Asp Cys Cys Met 
50 55 60 

GAG AGG ATC CTC GAG CGT TAT GAG AGA TAT TCA TAT GCA GAG AGC CAG 481 
Glu Arg He Leu Glu Arg Tyr Glu Arg Tyr Ser Tyr Ala Glu Ser Gin 
65 70 75 

GTT CTC ACA AAC AAT GCC GAA ACC AAT GGG AAC TGG ACT TTG GAA CAT 52 9 

Val. Leu Thr Asn Asn Ala Glu Thr Asn Gly Asn Trp Thr Leu Glu His 
80 85 90 . 95 

GCA AAA CTC AAG GCC AGG ATG GAG ATT CTG CAG AAA AAT CAA AAG AAT 577 
Ala Lys Leu Lys Ala Arg Met Glu . He Leu Gin Lys Asn Glh Lys Asn 

100 105 110 

CTG ATG GGA GAA GAA CTC GAT TCT CTA AGC CTC AAA GAG CTT CAG AAT 625 
Leu Met Gly Glu Glu Leu Asp Ser Leu Ser Leu Lys Glu Leu Gin Asn 
115 120 125 

TTG GAG CAT CAG CTT GAC ACT GCT CTT AAG AAC ATT CGG TCT AGA AAG 673 
Leu Glu His Gin Leu Asp Thr Ala Leu Lys Asn He Arg Ser Arg Lys 
130 135 140 

ATT CAA CTC ATG TGT GAA TCC ATC TCT GAG CTT CAG AGA AAG GAT AAG 721 
He Gin Leu Met Cys Glu Ser He Ser Glu Leu Gin Arg Lys Asp Lys 
145 150 155 

GCG CTG CAA GAA CAA AAC AAC ATG CTG GCA AAG AAG GTG AAA GAG AAG 76 9 

Ala Leu Gin Glu Gin Asn Asn Met Leu Ala Lys Lys Val Lys Glu Lys 
160 165 170 175 

GAG AAG GCA CTG GCA CAG CAA ACT CAA TGG GAC AAC TCG CAA GAT GAT 817 
Glu Lys Ala Leu Ala Gin Gin Thr Gin Trp Asp Asn Pro Gin Asp Asp 

180 185 190 

GGC CTC ACC TCT TCC TCG GGG TCC TTA CCA TCC GAG TGG AAT TGA GGA 865 
Gly Leu Thr Ser Ser Ser Gly Ser Leu Pro Ser Glu Trp Asn * Gly 
195 200 205 

GGG AGC GGC CCT CGG TCC ACC TCA ACA TCG TAA TGC CAC CCT TTT TCC 913 
Gly Ser Gly Pro Arg Ser Thr Ser Thr Ser * Cys His Pro Phe Ser 
210 215 220 

TTC ATG GAT GCT TAG CCA CCT CCA AGA A TAATGATCCA ATTTATTTGC 961 
Phe Met Asp Ala * Pro Pro Pro Arg 
225 230 

ATGGGAACCA TATATCATTA CCAATAT TAT CTAATGCTTC TTATCGTATA TATCcAviTCG 1021 

ATATGTAACC ATAATGTAAC ATGTCTTAGT ACATGCATTG TGGGGATACT TGCAAAAACC 1081 



TTGTTTTGTA GG 



1093 
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GCTTCCATCT CCAGGGAAGA GTTTTGAACT TGCGACC ATG GTT TTT CCA ACC CAA 5 5 

Met Val Phe Pro Thr Gin 
1 5 

GCC ACG CCC GAG GAG TCC CCG CAG AGG AAA ATG GGG AGG GGA AAG ATC 103 
Ala Thr Pro Glu Glu Ser Pro Gin Arg Lys Met Gly Arg (="y Lys lie 
10 15 ^ 

GAG ATC AAG CGG ATC GAG AAC ACG ACG AAT CGG CAA GTG ACT TTC TGC 151 
Glu lie Lys Arg lie Glu Asn Thr Thr Asn Aro Gin Val Thr Phe Cvs 
25 30 35 

AAG CGG CGG AAT GGC CTC CTC AAG AAG GCA TAT GAA CTC TCC GTT CTT 19 9 

Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala Tyr Glu Leu Ser Val Leu 
40 45 * 50 

TGC GAA CGC GAG GTC GCC CTC ATC GTC TTC TCC AGC CGC GGT CGC CTC 24 7 

Cys Glu Arg Glu Val Ala Leu lie Val Phe Ser Ser Arg Gly Arg Leu 
55 60 65 70 

TAT GAG TAT GCC AAC GAT AGT GTC AAA GCA ACC AT*~ GAG AGG TAC AAG 2 95 

Tyr Glu Tyr Ala Asn Asp Ser Val Lys Ala Thr lie Glu Arg Tyr Lys 

75 80 85 

AAG GCT TTC TCA GAT TCC TCC AGT AGC GGG ATC CGT TCT GAA GCT AAT 34 3 

Lys Ala Phe Ser Asp Ser Ser Ser Ser Gly lie Arg Ser Glu Ala Asn 
90 95 100 

GTT CAG TTT TAT CAG CAA GAA TCC GCA AAG TTG CAA CAA CAG ATT AAT 3 91 

Val Gin Phe Tyr Gin Gin Glu Ser Ala Lys Leu Gin Gin Gin lie Asn 
105 110 115 

AAC ATG CAG AAC .AAT AAC AGG CAA CTG GTG GGT GAC TCA ATT GCT GGG 43 9 

Asn Met Gin Asn Asn Asn Arg Gin Leu Val Gly Asp Ser He Ala Gly 
120 125 130 

ATG AAT ATG AAG GAT ATG AAG ACT ACG GAG CAA AAA CTA GAA AAA GCA 48 7 

Met Asn Met Lys Asp Met Lys Thr Thr Glu Gin Lys Leu Glu Lys Ala 
135 140 145 150 

ATC GCT AAA ATT CGC GCC AAA AAG AAT GAG CTT CTA TTT GCG GAG ATT 53 5 

He Ala Lys He Arg Ala Lys Lys Asn Glu Leu Leu Phe" Ala Glu He 

155 160 165 

GAG TAT ATG CAA AAA AGG GAA ATT GAC CTA CAC AAC AAC AAT CAA GTT 58 3 

Glu Tyr Met Gin Lys Arg Glu He Asp Leu His Asn Asn Asn Gin Val 
170 175 180 

CTG CGA GCG AAG ATC GCG GAG AGC GAG AGG TCC CAA CAT GCG GAT ATG 6 31 

Leu Arg Ala Lys He Ala Glu Ser Glu Arg Ser Gin His Ala Asp Met 
185 190 195 

AAT TTG ATG CCG GGA GGT ACC AAC TAT GAC TTC ATG CAG CCA TCT TCT 679 
Asn Leu Met Pro Gly Gly Thr Asn Tyr Asp Phe Met Gin Pro Ser Ser 
200 205 210 

TCT CAG CCT TTT GAT TCC AGA AAC TAC TTC CAA GTG AAC GTG CTG 724 
Ser Gin Pro Phe Asp Ser Arg Asn Tyr Phe Gin Val Asn Val Leu 
215 220 225 

TAGCACCCAA CCACTGTGAG TGTGCACAAG TCTAGTATCG ATTGAAGGGA ATATAACTAG 784 

TATATGGAGC TTCTAAGTGC TTATTATGAG GTTATGATGG ACTTCATCAG ATGTTAATTA 844 

AGTAGAAGCA GTTGGCAGAT GCATCTATTA GGTGAAGGGC CTCCTCATCC TTTTGTTGAT 904 

TAGCATCCTT AGATGTCTTA AAATTTCCAC CATCTTTGTT AAGGTGTGTA TTTGGGTCCC 964 

TATGTACTAA ACTGAATATG AACTATCGAA CTA 997 



